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Abstract. The decimal expansion real numbers, familiar to us all, has a dramatic 
generalization to representation of dynamical system orbits by symbolic sequences. 
The natural way to associate a symbolic sequence with an orbit is to track its history 
through a partition. But in order to get a useful symbolism, one needs to construct 
a partition with special properties. In this work we develop a general theory of 
representing dynamical systems by symbolic systems by means of so-called Markov 
partitions. We apply the results to one of the more tractable examples: namely 
hyperbolic automorphisms of the two dimensional torus. While there are some results 
in higher dimensions, this area remains a fertile one for research. 



1. Introduction. 

We address the question: how and to what extent can a dynamical system be 
represented by a symbohc one? This has roots in a concept of basic arithmetic 
familiar to us all: namely, the representation of real numbers by infinite binary 
expansions. As the example of Section 3.2 shows this is related to partitions with 
special properties which is the theme of this work. 

K. Berg [B] in his Ph.D thesis discovered partitions (now called Markov) of the 
two dimensional torus which give rise to discrete time Markov processes under the 
action of hyperbolic automorphisms. Shortly after the B. Weiss and the author 
[AW] proved that such automorphisms are measure theoretically isomorphic if and 
only if they have the same entropy. Their proof was based on two ideas: 

(1) symbolic representations of dynamical systems by means of Markov parti- 
tions; 

(2) coding between symbolic systems having equal entropy. 

Each of these two aspects has undergone extensive development since. The present 
work is concerned with a systematic treatment of the first idea within the framework 
of point set topology: that is, we develop the notion of a discrete time topologi- 
cal Markov process without any recourse to measure theory and use it to obtain 
symbolic representations of dynamical systems. A future paper is planned expand- 
ing on the second item above incorporating it in a general isomorphism theory of 
abstract dynamical systems. 
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A disquieting aspect of the result of Adler- Weiss was a certain vagueness where 
one expects certainty: namely, not quite knowing how to compute the numerical 
entries of an certain integral matrices proven to exist. Theorem 8.4, the main one 
of Section 8, is an improvement on the results of Berg and Adler- Weiss, and does 
not suffer from this difficulty. The proof involves four cases: Case I, the simplest, 
was done by Anthony Manning many years ago as the author learned from Peter 
Walters and still may be unpublished. 
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In Section 2, we briefly introduce the concept of an abstract dynamical system 
and then go on to give three important concrete examples of such systems: namely 
multiplication maps, toral automorphisms, and symbolic shifts. 

In Section 3, we discuss symbolic representations of dynamical systems and il- 
lustrate them for the three concrete systems introduced in the previous section. 

In Section 4, we present some general notions needed for our theory of symbolic 
representation of dynamical systems. 

In Section 5, we introduce the notion of topological partition, and show how one 
gets a symbolic representation from such an object. In order to simplify notation, 
an improvement in the choice of elements for such partitions was suggested by D 
Lind: namely replacing "proper sets with disjoint interiors" , a proper set being one 
that is the closure of its interior, by "disjoint open sets whose closures cover the 
space" . There is a difference between an open set and the interior of its closure, and 
exploiting this seemingly slight difference leads not only to notational conveniences 
but also to pleasant simplifications in subsequent proofs. 

In Section 6 we define topological Markov partitions and prove Theorem 6.5, 
the main theorem of this work. This result concerns getting by means of Markov 
partitions the best that can be expected as far as symbolic representations of a 
dynamical systems is concerned. Also in this section we prove a converse to the 
main theorem. Theorem 6.10, by which one gets Markov partitions from symbolic 
representations. This leads to the question: does one construct Markov partitions 
to get symbolic representations or does one produce a symbolic representations to 
get Markov partitions? The answer to this riddle as far as the current evidence 
seems to indicate is discussed in Section 9. 

In Section 7, we provide results useful for constructing Markov partitions, par- 
ticularly the final theorem of the section. Theorem 7.12. 

In Section 8, we apply Theorem 7.12 to construct certain special Markov parti- 
tions for arbitrary hyperbolic automorphisms of the two dimensional torus, which 
is the content of Theorem 8.4. These partitions have the virtue that a matrix spec- 
ifying a hyperbolic automorphism is also the one that specifies a directed graph 
from which the symbolic representation is obtained. The proof we present, though 
involved, is quite elementary using mainly plane geometry. 

The spirit of this work is to rely solely on point set topology. We avoid any 
measure theory in this discussion. Perhaps a course in point set topology might 
be spiced \vp by using items in this work as exercises. In addition, our style of 
presentation is an attempt to accommodate students as well as experts. 

The research behind this work was carried out over many years, in different 
places, and with help from a number of colleagues, particularly Leopold Flatto and 
Bruce Kitchens. Work was done at the Watson Research Center, University of 
Warwick, and MSRI. Most of the research for Sections 5-7 was done in the MSRI 
1992 program in Symbolic Dynamics. 

2. Abstract and Concrete Dynamical Systems. 

At its most simplistic and abstract a dynamical system is a mathematical structure 
capable of generating orbits which evolve in discrete time. A map of a space into 
itself will achieve this. Depending on one's purpose additional structure is imposed: 
our requires some topology. Consequently, for us an abstract dynamical system is 
a pair (X, 0) where X is a compact metric space with metric, say, d{- and cf) 
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is a continuous mapping of a X into itself. We shall refer to X as the phase 
space of the dynamical system. The orhit of a point p e X is defined to be the 
sequence (</'"p)n=o,i,2,...- We shall consider systems where <f) is onto. Also we shall 
be mainly, though not exclusively, interested in invertiblc maps- i.e. where (/) is 
a homeomorphism-in which case the orhit of a point p € X is defined to be the 
bilaterally infinite sequence (0"p)„gz. For invertible maps we can speak of past, 
present, or future points of an orbit depending on whether n is negative, zero or 
positive; while for non-invertible maps there is only the present and future. 

For the above category of abstract systems, we have the following notion of total 
topological equivalence. 

Definition 2.1. Two systems (X. (6), {Y-i') arc said to be topologically conju- 
gate., {X, (j)) ~ (y, if there is a homeomorphism of X onto Y which commutes 
with (j) and ip: i.e., (f)9 = Oijj. 

We introduce some classical concrete dynamical systems. The first type is most 
elementary. Though non-invcrtible, it illustrates admirably some essential ideas 
which we shall discuss later. 

2.1 Multiplication maps. 
Let {X, /) be the system whoso phase; space is the complex numbers of modulus 
one- i.e. elements of the unit circle-acted upon by the mapping / : ^; — > 0" for 
some integer n > 1. 

For our purposes it is more convenient to consider a topologically and alge- 
braically equivalent formulation. Let X = M-fZ where M is the real line and Z the 
subgroup of integers. Recall that elements in X are cosets modulo Z. The coset 
of a; e M modulo Z, which we denote by {x}, is the set {x + z\z € Z} of lattice 
translates of x. Invoking some standard terminology, the real line M can be referred 
to as the universal cover of the the circle X. In view of the fact that Z acts as a 
group of transformations on the universal cover M, a coset is also called a Z— orbit. 
Two points x, x' in the same coset or Z— orbit are said to be equivalent mod Z. 
The metric is given by defining the distance between pairs of cosets as the smallest 
Euclidean distance between pairs of members. Recall that the coset of x + y de- 
pends only on the coset of x and that of y: that is, if x' G {x} and y' £ {y}, then 
{x' + y'} = {x + y}. Thus addition of cosets given by {x} + {y} = {x + y} is well 
defined and so is the multiplication-by-n map / : {,r} {.r} + • • • + {x} (n-times) 
= {nx}. This map is continuous with respect to the metric. It is not invertible: 
every coset {x} has n pre-images which arc {{x + m)/n}, m = 1, . . . ,n. 

The closed unit interval [0, 1] is a set referred to as a fundamental region for the 
action of Z on M. 

Definition 2.1.1. A fundamental region is defined as a closed set such that 

(1) it is the closure of its interior; 

(2) every orbit under the action has at least one member in it (this is equivalent 
to the statement that the translates of the unit interval by elements of Z 
tile M); 

(3) no point in the interior is in the same Z-orbit as another one in the closed 
region (this restriction does not apply to two boundary points- e.g. the 
points and 1 are in the same one). 

Fundamental regions are not unique: for example, the interval [1,2] is also a 
fundamental region for the action of Z on M, though not a particularly useful one. 
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One can give another equivalent reformulation of the phase space of these systems 
in terms of a fundamental region with boundary points identified. Let X be the 
closed unit interval with identified with 1. We define a metric on X by 

d{x,y) = m.ui(\x - y\,\x - y - 1|, |a; - y + 1|). 

From now on let us take the notation {x} to mean fractional part of a real number 
X. On X the map / takes the form 

f{x) = {nx}. 

Since all numbers in a coset have the same fractional part and that number is the 
unique member of the intersection of the coset and X, this new interpretation of 
{x} is consistent with the old. 

A more serious set of examples are continuous automorphisms of certain com- 
pact Abelian groups-namely the n-dimensional tori. For simplicity we restrict the 
discussion to the case of dimension two, generalization to higher dimensions being 
quite analogous. 

2.3 Toral Automorphisms. 
Consider the two dimensional torus jl? and a continuous group automorphism 

Here the universal cover of the 2-torus is M^. The description of the action of the 
integers on the real line generalizes in a straight-forward manner to the action of 
the subgroup 1? of points with integer coordinates on the universal cover M^. The 
definitions of cosets, lattice translates, orbits, addition, and the metric are quite 
similar. 

A continuous automorphism is specified by a a 2 x 2 matrix A : with integer 
entries and determinant ±1. Let 




The matrix A determines an invertible linear transformation on ^ . We represent 
the points in the plane by row vectors and the action of the linear transformation 
by right"'^ matrix multiplication. The map (/) is then defined as follows: the image 
of the coset containing (a;, y) is the one containing (ax + cy, hx + dy). This map is 
well-defined- i.e., the image does not depend on the choice of coset representative 
(x, y) because A is invertible and maps 1? onto itself. 

Some things to note. The map (/) is a homeomorphism. The coset {0} = 1? is 
a fixed point of There may of course be other fixed points. A pair (a;, y) is in a 
coset which is a fixed point if and only if it satisfies the two linear equations given 

by 

{x,y)k. = {x,y) + (m,n) 

for some pair of integers (m, n). The only solutions are rational. In addition, a coset 

is periodic imder (j) if and only if it is fixed imder some iterate A". Furthermore, if 
a coset contains a point with rational coordinates, then it is periodic which follows 
from the fact that the product of the denominators in a rational pair {x, y) bounds 



Right multiplication on row vectors turns out to be a little more convenient than left multi- 
plication on column vectors. 
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the denominators in the sequence {x, y), {x, y)A, {x, y)A'^, . . . which impHes that the 
orbit of {x, y) is finite. 

The plane is the universal cover of the 2-torus, and any closed unit square 
with lattice points as corners is a fundamental region. We shall call the one with 
its lower left corner at the origin the principal fundamental region. Like the one 
dimensional case, we can formulate the system in terms of it. Let X be the closed 
unit square with each point on one side identified with its opposite on the other. 
The coset of {x,y) intersects X in a unique point: namely, {{x},{y}). On X the 
map (j) takes the form 

<f>ix, y) = {{ax + cy}, {hx + dy}). 

Unlike the case of one dimension, other fundamental regions, as we shall see, play 
a crucial role in studying the action of automorphisms. 

Finally wc come to the most basic of concrete systems. Ultimately wc shall show 
to what extent they model others, in particular multiplication maps and hyperbolic 
toral automorphisms. 

2.4 Symbolic Shifts'^. 

Let A, called an alphabet, denote an ordered set of symbols, often taken to be 
{0, 1, . . . , A'^ — 1}. The domain of this system is the space 

(2.4.1) T.N = A^ = {s= (s„)„ez|sn e A] 

of all bi-infinite sequences of elements from a set of N symbols. One can think of 
an element of this space as a bi-infinite walk on the complete directed graph of 
N vertices which are distinctly labelled. Sometimes it is more convenient to label 
edges, in which case the picture is a single node with N oriented distinctly labelled 
loops over which to walk. In Figure 2.1 we have depicted the full 2-shift by both 
types of graph labelling. 

1 



Vertex labelled EdgelabeUed 



Figure 2.L Full 2-shift 



The shift transformation a is defined by shifting each bi-infinite sequence one 
step to the left. This is expressed by 

{<js)n = Sn+\- 



^For comprehensive treatment of this area we refer the reader to the book by Lind and Marcus 
[LM]. 
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We define the distance d{s, t) between two distinct md t as l/(|n| + 1) 

wliere n is the coordinate of smallest absolute value where they differ. Thus if 
d{s,t) < 1/n for n > then Sk = tk for — n < k < n. This metric makes the space 
E N one of the important compact ones-namely, the Cantor discontinuum- and the 
shift a homeomorphism. The symbolic system (EjVjCr) is called the full N-shift. 

Restricting the shift transformation of a full shift Sjv to a closed shift-invariant 
subspace S, we get very general kind of dynamical system {T,,a) called a subshift. 
Given a symbolic sequence s = (s„)„gz and integers m < n, we shall use the 
notation S[rn,n] to stand for the m — n+l-tuple (s^, Sm+i, ■ ■ ■ , s„). Given a symbolic 
phase space E, we call a k-tuple an allowable k-block if it equals s^^^^^k-i] ^ov some 
s e E. 

Returning to the realm of the more specific from our momentary excursion into 
the less knowablc. wc define shift of finite type, also called topological Markov shift 
, as the subshift of a full shift restricted to the set Eg of bi-infinite paths in a finite 
directed graph G derived from a complete one by possibly removing some edges. 

Usually we denote the space Eg by Ea where A is a matix of non-negative 
integers Oy denoting the number of edges leading from the i-th node to the j-th. 
The term "Markov" is derived from the resemblance to Markov chains for which 
the fly are probabilities instead of integers. One thing to note is that the ij entry 
of A" is the number of paths of length n beginning at i—th node and ending at 
the j—th. Often however A is an N x N matrix of zeroes and ones specifying a 
directed graph of N nodes (edges) according to the following: the i-th node (edge) 
is connected to the j-th, i — > j, if and only if a^- = 1. Whether dealing with nodes 
or edges, we call A a transition matrix and restate for zero-one matrices the above 
definition by 

(2.4.2) Eg = Ea = {s =(..., s„, . . . )|a.,„.,,„+i = 1, s„ S A G Z}. 

Remark.. Let (Ecc) be a topological shift given by a node-labelled directed 
graph G. Nodes from which there is no return are called transient, the rest recurrent. 
A node is transient if and only if either it has no predecessor nodes or all its 
predecessors are transient. This statement is not as circular as it seems: for the set 
of predecessors of any set of transicnit nodes, if non-empty, is a strictly smaller set 
of transient nodes. The only symbols which appear in bi-infinite sequences of Eg 
are labels of recurrent nodes. 

Figure 2. II describes the Fibonacci or golden ratio shift, so-called because the 
number of admissible n-blocks (paths of length n) are the Fibonacci numbers- 
namely, there are two 1-blocks, three 2-blocks, five 3-blocks, .... 



Figure 2. II. Fibonacci shift 



Here the space Ea is given by the matrix 

1 1 
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If wc label the first node by and and the second by 1. then only sequences of O's 
and I's with I's separated by O's are admissible. While other shifts of finite type 
can be specified by graphs with either nodes or edges labelled, there is no edge 
labelled graph for the Fibonacci shift. 

Given a node labelled graph G, we define the edge graph G^^^ by labelling the 
edges. For labels we can use the allowable 2-blocks. In general we define the higher 
edge graphs G*^"^ as follows. The alphabet consists of all allowable blocks [ai, . . . , a„] 
gotten from paths of length n on G. The transitions are defined by 

[ai, . . . ,a„] — > ...,bn] 

if and only if 6i = 02, . . . , 6n-i = On- 

There is a one-side version of the full AT-shift: namely, 

(2.4.3) S+ = {s = (so, si, . . . )|s„ e A n = 0, 1, 2, . . . }. 

On this space the shift transformation a is similarly defined: namely, (i7s)„ = s„+i 
but only for non-negative n. It acts by shifting sequences one step to the left and 
dropping the first symbol. The metric on this phase space is defined the same as 
before but absolute value signs are not needed. We also have one-side versions of 
shifts of finite type. In one-side symbolic systems, the shift transformation, like a 
multiplication map, is continuous but not invertible. 

Exercises. 

3.1 Prove that the canonical map tjj : G^"^ — > G defined by ^'[sij • • • ) Sn] = Si 
gives topological conjugacy of (S(3(„),cr) and (Sccr). 

3 Symbolic Representations. 

Shifts of finite type contain a great deal of complexity, yet are the best understood 
dynamical systems. Such symbolic dynamical systems can be used to analyze gen- 
eral discrete time ones. For example, a good symbolic representation will show how 
to identify periodic orbits, almost periodic ones, dense ones, etc.. 

Representing a general dynamical system by a symbolic one involves a funda- 
mental complication. We have two desires: we would like a continuous one-to-one 
correspondence between orbits (j/^x of the first and orbits cr"s of the second; and 
we want the shift system to be one of finite type. Unfortunately these two desires 
arc in conflict: constraints placed by topology must be observed. On one hand a 
continuous one-to-one correspondence makes X homeomorphic to a shift system. 
On the other hand a shift system is totally disconnected while X is often a smooth 
manifold. Thus for the most part we must abandon the quest of finding a topological 
conjugacy between a given dynamical system and a shift of finite type. However, 
we shall see that by sacrificing one-to-one correspondence we can still salvage a 
satisfactory symbolization of orbits. We are reminded of arithmetic in which we 
represent real numbers symbolically by decimal expansions, unique for the most 
part, but must allow two expansions for certain rationals. To do otherwise would 
just make the instructions for arithmetical operations unnecessarily complicated. 
The most natural way to associate a symbolic sequence with a point in a dynamical 
system is to track its history as illustrated in Figure 3.1 through a family of sets 
indexed by an alphabet of symbols. 
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Figure 3.1. Partitioning a dynamical system 

This is easy, but what is more difficult is to get a family for which each history 
represents just one point. It is no achievement to specify a family for which each 
history might represent more than one point. However, we must live with the 
inevitability that each point might have more than one associated histories. Having 
found a family of sets, the orbits through which determine a unique point, we want 
still more: namely, we would like the totality of sequences which arise to comprise 
a subshift of finite type. In order to do this, we must find a family with special 
properties. We shall look at some examples for guidance as to what these properties 
ought to be, and families of sets possessing them will be called Markov partitions. 

The first example is the trivial case of a dynamical system which is identical 
with its symbolic representation: namely, a topological Markov shift. 

3.1 Cylinder Set Partition for Symbolic Sequences. 
Let (SajCt) be a topological Markov shift, vertex labelled by an alphabet A. We 
form the partition C = {Ca : a G A} of elementary cylinder sets determined by 
fixing the 0-th coordinate: i.e., C'a = {s G "^a : sq = a}. Tracking the history of an 
orbit of a element s € through this partition means getting a sequence (Sn)nez 
such that a'^s G Cs„- But this sequence is s itself. Let us point out the salient 
features of this partition. 

First^, 

OO n 

n— — n 

Second, if s € Ca fl a~^Cb ^ then s € Ca and as € Cf, '■ i.e., sq = a,si = b. 
In terms of the graph, this means there is an edge from a to b. An absolutely 
obvious property of directed graphs is the following. If there is an edge from a to 
b, and an edge from b to c, then there is a path from a to c via b. This property 
can be reformulated as follows. If Ca n a~^Cb ^ and Cb fl a~^Cc 7^ then 
Ca n a~^Cb n (T~^Cc 7^ 0. This property has a length n version for arbitrary n: 
namely, n abutting edges form a path of length n + 1 and this can be reformulated 
to read that n pair- wise non-empty intersections lead to an (n -|- 1)— fold non- 
empty intersection. We shall call such a countable set of conditions for n = 2,3, ... 
the Markov property: it turns out to be a key requirement in getting the desired 
symbolic representation from a partition. 



^Prom now on we shall commit a convenient semantic error of confusing a set consisting of a 
single point with the point itself and so dispense with the surrounding braces 
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Finally, there is another important feature of the partition C = {Ca '■ a € A} : 
namely, the sets of this partition have a product structure respected by the shift 
which is described as follows. Let s e Ca - in other words, sq = a - and define two 
sets 

oo 


which we shall call the vertical through s and 



ha{s) ^ fl a-'Cs, 

— oo 

which we shall call the horizontal. A sequence s € Ca is the sole member of the 
intersection of its vertical and horizontal i.e. {s} = Va{s) D ha{s). Furthermore, 
for s,t £ Ca there is a unique sequence in the intersection of the horizontal through 
s and the vertical through t : namely, {(. . . s_2, S-i, sq = ^O) ^i, ^2, • ■ • )} = Vais) fl 
ha{t). We define a map of Co x Ca onto Co by (s, t) i-^ ha{s) fl Va{t), or rather the 
sole element of this intersection. It is easily verified that this map is continuous and 
its restriction to /ia(s) x Va{t) for any s, i G Co is a homeomorphism of ha{s) x Va{t) 
onto Ca- Finally a respects this product structure in the sense that if s e Caf\(T~^Cb 
then: 

(JVa{s) C Vb{as), 

aha{s) D hb{as). 

This last property is closely connected with the Markov one. 

The next example is based on the binary expansions of real numbers, and illus- 
trates what one should expect of a good symbolic a representation of a dynamical 
system. 

3.2 Symbolic Representation for Multiplication by Two. 
Let {X,f) be the multiphcation system where and f : x ^ {2a;}. Recall that the 
domain of the one-sided full 2-shift dynamical system (EJ^j,ct+) is the set of 
onc-sidcd infinite walks on the cdgc-labcUcd graph in Figure 2.1. We can equate a 
sequence s = {sn)nez with the binary expansion .S1S2S3 ■ • • • Consider the map tt 
from to X defined by 7r(si , S2 , • • • ) = {si/2 + S2/4 + . . . }. It is readily verified 
that 

(i) /tt ~ 7ri7+, 

(ii) TT is continuous, 

(iii) TT is onto, 

(iv) there is a bound on the number of pre- images (in this case two), 

and 

(v) there is a unique pre-image of "most" numbers (here those with binary 
expansions not ending in an infinite run of all zeros or all ones). 

The map n is not a homeomorphism, but we do have a satisfactory representation of 
the dynamical system by a one-sided 2-shift in the sense that: orbits are preserved; 
every point has at least one symbolic representative; there is a finite upper limit to 
the number of representatives of any point; and every symbolic sequence represents 
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Figure 3. II. Graph of multiplication by 2 (mod 1) 

some point. This is a example of what is known as a factor map which we shall 
formalize in §4. 

As we have led to reader to expect, there is an alternate definition of tt in terms 
of a partition. Consider TZ = {Rq = (0, 1/2), i?i = (1/2, 1)}. The elements of this 
family are disjoint open intervals whose closure cover the unit interval. The map tt 
which associates sequences with points has an alternate expression in terms of this 
family: namely, 

oo 

7r(si,s2,...)= fl Rs, n f-^Rs,) n • • • n /-"(i?,„+J. 

n=0 

Remeirk. The reader might wonder about defining n by the simpler expression 

oo 

7r(si,S2,...)= n /""(^«n+J- 

n=0 

There are cases where this would suffice, but a difficulty can arise and does here. 
In X the point which is identified with 1 is a fixed point of / which implies that 
G .f ^"-Ri for i = 0,1 and n > 0. Thus, except for the all or all 1 sequence, 
7r(si, S2, . . . ) is a set which does not consist of a singleton: it contains two real 
numbers, one of which is the fixed point 0; and this renders tt ill-defined. The most 
we can say in general is that 

oo oo 

fl R,, n /-i(i?.J n • • • n /-"(i?.„+J c f| /-"(i?.„+J 

n=0 n=0 

However, for the so-called expansive dynamical systems, when the size of partition 
elements is uniformly small enough, equality holds in which case tt would be well- 
defined (see Proposition 5.8). 

Next we consider hyperbolic automorphisms of the 2- torus. This was the first 
smooth class of invertible dynamical systems found to have Markov partitions. This 
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discovery was made by K Berg [B] in 1966 in his doctoral research. A short time 
later R. Adler and B Weiss [AW] constructed some special Markov partitions in 
order to prove that two such systems are conjugate in the measure theoretic sense 
if they have the same entropy. For these systems topological conjugacy implies 
measure conjugacy, but not conversely. We shall give a formal development the 
general two-dimensional case in a later chapter. Before making that plunge, we 
shall wet our toes with an informal discussion of one specific illustrative case. A 
rigorous proof of what we arc about to describe will be achieved by Theorem 7.13. 

3.3 Partition for a Toral Automorphism. 
Take the matrix 

[l 

which we have met before in quite a different context. Let X be the two-torus and 
(j) be given by A : that is, 

(t>{x,y) = {{x + y},x). 




^ + 

Figure 3. III. The torus and eigen-directions of A 



The matrix A has two eigenvalues: A = (1 + \/5 )/2 and /it = (1 — \/5 ) /2. Observe 
that A > 1 and — 1 < /i < 0. Associated with these eigenvalues arc the eigenvectors 
VA pointing into the first quadrant and v^ into the second. In Figure 3. Ill we have 
drawn two lines through the origin in the eigenvector directions. The action of A 
on a vector is to contract its v^-component by and expand its VA-component 
by A. Note fi is negative, which causes a direction reversal besides a contraction in 
the v^-component. We refer to the direction of va as the expanding direction and 
that of v^ as the contracting direction. 

In Figure 3. IV we draw another region with sides parallel to the expanding and 
contracting directions. That it is a fundamental region is verified by noting that 
each of the three triangles sticking out of the unit square is a translation by an 
element of of one of the three missing triangles inside. 

We call this fundamental region the principal one, and draw within it the col- 
lection of open rectangles TZ = {Ri : i = 1,2,3} as depicted in Figure 3.V. This 
family is an example of a type of partition we shall later describe as Markov. We 
label significant points using the same letters for those which are equivalent. 
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Figure 3. IV. Another fundamental region of 2-torus 

The image of this partition under the hnear transformation determined by A is 
depicted in Figure 3. VI. In drawing it the following calculations come into play: 
(1,0)A=(1,1), (1,1)A=(2,1), (2,1)A=(3,2). 

+ 




+ 



Figure 3. VI. Its image 

Along with the image we have included an outline of the original partition. Notice 
that (pRs is actually the same as i?2- Also notice how the other (t)Ri overlap Ri and 
R:^. The manner in which the image partition intersects the original partition can 
be summed up as follows: (pRi (1 Rj ^ $ according to whether j follows i in the 
edge graph in Figure 3.VII. 

The boundary of the sets in TZ, dRi = Ri — Rj , consists of various line segments 
in the va and v^ directions. The union of those of the RiS in the VA-direction is 
called the expanding boundary of the partition and those in the v^-direction, the 
contracting boundary. By lattice translations of the various bounding segments, we 
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Figure 3.V. Partition of 2-torus 

can reassemble their union into two intersecting line segments through the origin, 
oh and ad, as shown in Figure 3. VI. 

The behavior of the boundary under the action A leads to a topological Markov 
shift representation. The essential properties are that ad contains its image under 
A; whereas oh is contained in its image, or equivalently oh contains its inverse 
image. Because A preserves eigen-directions and keeps the origin fixed, it is easy 
to see that oh gets stretch over itself; but because there is a reflection involved it is 
not enough to know that the length of ad is contracted by A. We must show that 
the points a and d on the line segment ad have their images within that segment. 
These points are the projections to in the v^-eigen-direction from (0, 1) and (1, 0) 
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Figure 3. VI I. Edge graph for ^ acting on TZ 

respectively: so their images are the projections from the images of these lattice 
points which are (1,0) and (LI) respectively. Thus the image of d is a, and the 
image of a is c. These facts about the expanding and contracting boundaries imply 
that refinements of the original partition under positive iterates of (f) do not have 
any new boundary segments in the v^-direction that aren't already contained in 
ad, while under negative iterates of (j) there are no new ones in the VA-direction 
not already in ob. Prom this we obtain that for n e N a set n Rj, if non- 
empty, is a union of rectangles stretching in the expanding direction all the way 
across Rj. Similarly, a non-empty Ri fl (j)~^Rj is a union of rectangles stretching 
in the contracting direction all the way across Ri. When n = 1, it can be seen 
that these each of these unions consists of a single rectangle. This implies that if 
nfe=-n 4>~'^Rsk 7^ 0: then this intersection is a single rectangle stretching all the 
way across Rg^ in the expanding direction. Similarly, if nfe=o 4'~^I^sk then 
this set is a single rectangle stretching all the way across Rg^ in the contracting 
direction. 




Possible Impossible 

Figure 3. VIII. Intersections 

Combining these two results wc have that a non-empty closed set of the form 
nfc=_„ 4'~^Rsk is a closed rectangle. The diameter of these sets is uniformly 
bounded by constant x Thus as n ^ oo, a sequence of such sets decreases 

to a point in X. Consequently, such a point can bo reprc!scntcd by a sequence 
s = {sn)n&- If fact, all points of the torus can be so represented. 

If riL-n ^sfc 7^ 0, thenit isclearthat iis^n^ii-^iJ^.+i 7^ for -n < i < n-1. 
The converse which is the Markov property is really the main one we are extracting 
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from the geometry of this example. As we have seen Rg. n 4>^^Rsi_^_i 7^ if and only 
if edge Sj+i follows edge Si according to the the graph of Figure 3. VII. This means 
that the sequences s = (s„)„gz are elements is a topological Markov shift. 

Once again sets Ri have an obvious product structure. For p G i?i we call 
the segment hi{p) specified by intersection of Ri and the line through p in the 
expanding direction the horizontal through p. Similarly, we refer to Ui(p) given by 
the intersection of Ri and the line through p in the contraction direction as the 
vertical. Each rectangle is homeomorphic to the Cartesian product of any one of 
its horizontals with any one of its verticals. Just like for topological Markov shifts, 
the toral automorphism respects this structure: namely, for p e i?i fl (p~^Rj the 
following holds: 

0Wi(p) C 
(t)hi{p) D hj{(j)p). 

We shall incorporate we have just described in a comprehensive theory. 
4. More on Abstract Dynamical Systems. 

Definition 4.1. A dynamical system {X, </>) is said to be irreducible if for every 
pair of open sets U, V there exists n > such that (f)"U f\V 

Another concept we need is the following 

Definition 4.2. A point p is said to be bilaterally transitive if the forward orbit 
{(/)"p| n > 0} and the backward orbit {0"p| n < 0} are both dense in X. 

Remark.. A symbolic sequence in a topological Markov shift is bilaterally tran- 
sitive if every admissible block appears in both directions and infinitely often. 

We use the notation BLT{A) to denote the subset of bilaterally transitive points 
in ^ C X. 

In an irreducible system the bilaterally transitive points turn out to be every- 
where dense. To prove this, we recall the following theorem of point set topology. 
The theorem is more general, but can be slightly simplified in the case where the 

space AT is a compact metric space. 

Baire Category Theorem 4.3. Let {[/„} be a countable collection of open 
dense subsets of X. Then Q f7„ is nonempty. In fact Q J7„ is dense in X. Equiva- 
lently, a compact metric space in not the union of a countable collection of nowhere 

dense sets. 

Proof. Choose inductively balls such that i?„ C i?„ C ?7„, and 5„ c -Bn-i- 
The first property is easily achieved in a metric space; the second because Un is 
dense which implies that i?„_i n?7„ is a non-empty open set. The sequence (_B„)„gN 
has the finite intersection property: so by compactness f]Bn is nonempty. But 
f]Bn C f]Bn C n^n- Thus the intersection f]Un is not empty. It is also dense 
which is a consequence of replacing f/„ in the above argument hy UnOB and X by 
B where B is any ball. □ 

Proposition 4.4. // (A, </>) is irreducible, then the set of bilaterally transitive 
points is dense in X. 

Proof. Let {Un} be a countable basis for X. Since X is irreducible, Ufe>o ^''^n, 
as well as Ufe<o'^'^^"' dense in X for each n. The set BLT{X) of bilaterally 
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transitive points can be expressed as 

BLT{X) = fldJ ^ U '^'^")- 

n k>0 k<0 

We apply the Baire Category Theorem to get BLT{X) ^ 0. But BLT{X) contains 
the whole orbit of any of its points and the orbit of any of its points is dense in 
X. □ 

The three examples wc have disciisscd exhibit a certain property which is easily 
verified. It is concerns the divergence of orbits and is defined as follows. 

Definition 4.5. A homeomorphism (p is said to be expansive if there exists a 
real number c > such that if d{(f)"'p, (/>"q) < c for all n e Z then p = q. 

In the theory of Markov partitions this property plays a key role. Representation 
of dynamical systems by shifts of finite type, while possible for certain non-expansive 
systems, really only seems natural for expansive ones. 

Next we formalize a property of mappings between dynamical systems previously 
alluded to in connection with binary expansions. 

Definition 4.6. For two general dynamical systems {X,(j)) and {Y,tp) we call 
the second a factor of the first and the first and extension of the second, if there 
exists a map it of X into Y, which we call a factor map, such that 

(i) ^pTT = Tr(f), 

(ii) TT is continuous, 

(iii) TT is onto. 

Furthermore, we say tt is a finite factor map or that it is bounded-to-one if 

(iv) there is a bound on the number of pre-images; 
and essentially'^ one-to-one if 

(v) every doubly transitive point has a unique pre-image. 
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Y \|/ Y 

Figure 4.1. Commutative diagram illustrating a factor map 



*This term is used because in irreducible systems the non-doubly transitive points are negligible 
in both the sense of category and measure. 



18 



SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 



We remark that a topological conjugacy^ is a finite factor map where the bound 
on the number of pre-images is one in condition 1.2(iv). As we shall see, the 
seemingly slight weakening of the chains of topological conjugacy, which is what 
the definition of an essentially one-to-one finite factor map is meant to do, allows the 
necessary freedom to get symbolic representations for smooth dynamical systems. 

Proposition 4.7. Let tt be a factor map of {X,(j)) and {Y,ip) : i.e. tt satisfies 
properties (i), (ii), and (Hi) of Definition 4-6. If {X,<p) is irreducible, then so is 
(F,V); andY = BLT{Y). 

Proof. Let U,V be non-empty open subsets of Y. By properties (i) and (iii) 
of factor maps, 7r~^C/, 7r~^V are also non-empty and open. Since {X,(j)) is irre- 
ducible,there exists n > such that 0"(7r-^J7) n tt-'^V + 0. By 4.6(i), 

^ 7r[^"(7r-ic/) n TT-iy] = 7r[7r-i(0"[/ n y)] = n 



Thus iy,^)) is irreducible and from Proposition 4.4 follows that Y = BLT{Y). □ 

Proposition 4.8. Let {X,(j)) be irreducible and ti an essentially one-to-one fac- 
tor map of{X^(j)) onto {Y,tl)): i.e. tt satisfies (i) , (ii) , (iii) , and (v) of Definition 4- 6. 
Then tt maps BLT{U) homeomorphically onto BLT{tt{U)) for any open subset U 
ofX. 

Proof. From the properties of tt, if the forward orbit of x hits every nonempty 
open subset of X, then the forward orbit of 7r(a;) hits every nonempty open subset 
of y. Thus ttBLT{U) C BLT{tt{U)). 

We have that tt is a continuous one-to-one map of BLT{U) into BLT{tt{U)). We 
prove next that its inverse is continuous also. The proof is a standard compactness 
argument which goes as follows. Suppose y„ y, where yn,y e BLT{tt{U)). We 
shall prove that TT~^yn TT~^y. By compactness the sequence (7r~^i/„)„gN has limit 
points in X. Let x be any one of these limit points. By continuity ttx = y. But the 
pre-image of y is unique: so the sequence (7r^^j/„)„gN, having only one limit point, 
has a limit which is TT^^y. 

Now let X e tt^^BLT{tt{U)) C U and let V be any non-empty open subset of X. 
Choose V e BLT{V). Then by what we have already shown tt{v) G BLT{tt{V)). 
Then there exists a sequence of positive integers kn such that ip'''^'7T{x) — > tt{v). 
Thus 

(a;) tt~^iIj'''^tt{x) tt~^tt{v) = v. 

Thus X e BLT{U). We have therefore established tt-^ BLT{tt{U)) C BLT{U) : in 
other words, BLT{tt{U)) C ttBLT{U). □ 

Proposition 4.9. Under the hypothesis of Proposition 4-8, if U is an open 
subset of X, then BLT{ttU) = BLT[ttU]° and ttU = [ttU]°. 

Proof. Let y £ BLT{ttU). Then the unique pre-image of y Hes in BLT{U), and 
Try is not therefore in the closed set tt{X — U). Hence, Try S F — tt{X — U)c [ttC/]". 
Therefore, BLT{ttU) = BLTIttU]". 

^Ther term derives from the group theory notion of conjugate elements and its usage is standard 
in the subject. In the sense we are using it, better would have been homomorphism for factor 
map and isomorphism for topological conjugacy. These are the terms which denote the property 
of preserving structure. 
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From the continuity properties of tt, Proposition 4.8, and what was just proven, 
we get the following string of equalities: ^{U) = i:{BLT{U)) = BLT{wU) = 
BLT[ttU]° = □ 

Exercises. 

4.1 We call a directed graph G irreducible if given any pair of nodes i,j there 
is a directed path from i to j. Show that if G is irreducible as a graph, 
then the dynamical system (Sccr) is irreducible. Conversely, show that 
if the dynamical system (Sg,(t) is irreducible then there is an irreducible 
sub-graph G' such that Eg = Eq'. 

4.2 Given any topological Markov shift system (Eg) there exists irreducible 
sub-graphs Gi, . . . , Gm such that 

Eg = Egi U • • • U Eg„ (disjoint). 
5. Topological Partitions. 

Definition 5.1. We call a finite family of sets TZ = {Ro,Ri, . . ■ ,Rn-i} a topo- 
logical partition for a compact metric space X if: 

(1) Each Ri is open^ 

(2) R,nR^=^y^j 

(3) X = i?o U i?i U • • • U Rn-1-; 

Remark. For open sets [/, V, ?7 n V 7^ ?7 n 1^ 7^ 0. So for members of a 
topological partition we get the following string of implications: Ri H Rj = =J> 
RinRj =iD^Ri°nRj = ^^Ri"riR] = (D^Ri"r\R]° = 0. Thus Ri°r)R]° = 
for i ^ j. 

Definition 5.2. Given two topological partitions TZ = {Ro,Ri, . . . , Rn-i} and 
S = {So, Si, . . . , Sm-i} we define their common topological refinement 7?, V iS as 

n\/S = {Rir\Sj:RiG TZ, Sj € .S}. 

Proposition 5.3. The common topological refinement of two topological parti- 
tions is a topological partition. 

Proof. Let TZ and S be the two partitions in question. First of all, it is clear 
that the elements of 7^ V <S are disjoint. We show that the closure of elements of 
TZM S cover X. Let p G X. We have that p G Ri for some i. Thus there exists 
a sequence of points p„ G Ri such that (i(p„,p) < 1/n. Since 5 is a topological 
partition, for each n there exists Sj^ G S such that p„ G Sj^ . Since S is finite, there 
exists an index j such that j„ = j for an infinite number of n so that we can assume 
that the p„ were chosen in the first place such that each j„ — j. Since p„ G i?i n Sj 
we can choose a sequence of points qm,n G i?i fl Sj such that d{q„i,n,Vn) < l/rn. 
Thus d(q„,„, p) < 2/n whence q„,„ ^ p as n — > oo. Therefore p £ RiCi Sj. 
□ 



^Previous authors have taJjen these sets to be closed sets with the property that each is the 
closure of its interior. The present variation is slightly more general, just enough to maJje some 
notation and certain arguments simpler. In fact, an important example is presented in section 9 
of a partition whose elements are not the interiors of their closures. 
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Proposition 5.4. For dynamical system (X, 0) with topological partition TZ 
of X, the set (j/^TZ defined by (j/^TZ = . . . , ^"iijv-i} is again a topological 

partition. 

Proof. This is an immediate consequence of the following: (l)the image of a 

union is the union of images for any map; (2) a homcomorphism commutes with 
the operation of taking closures; (3)the image of an intersection is the intersection 
of images for a one-one map. □ 

From Proposition 5.3 and 5.4 we have that for m < n, Vm ~ ^"^TZ V 
(j/n-iq^ V • • • V (jf^TZ is again a topological partition. We shall use the notation 

n-l 

7e(") = \J (f)-''TZ. 

Thus 7^(2) = 7^V0-l7^ = {RiHcp-^Rj : Ri, Rj G TZ}. Observe that (7^(2))(2) = 7^(3), 
or more generally (7e("))(™) = 7e(»+™-i). 

The collection IJ cl/^TZ : n e Z is a collection of open dense sets to which we can 
apply the Baire theorem, but due to its special nature we can achieve a slightly 
stronger result with the same sort of proof. 

Proposition 5.5. Let TZ be a topological partition for dynamical system {X, cj)). 
For every p G X there exists a sequence {Rs,.)ke2 of sets in TZ such that p e 

Proof. Since Vm 'P'''^' m < n, is a topological partition, there is a set in it 
whose closure contains p, say flm '/*~'^^«f=- next show that in the refinement 
Vm^\ 0*^7?. the elements of the form nm^\ ^~^^k where tk = Sk ior m < k < n 
comprise a subfamily which is a topological partition of Pl"^ (^~'^i?s^ . Because 
Vm^\ ^'^^ satisfies 5.1(1) and (2), so does any subfamily. Condition 5.1(3) is a 
consequence of 

n+l n 

0<t„_i<Af-l m-1 m 
0<t„+i<N-l 
tk—^k, m<k<.n 

and the fact that the closure of a union is the union of closures. Thus we can choose 
by induction the sets Rg^. as follows. Once having specified sets Rs_^, . . . , Rs^ 
such that p e (yin^ '^^skt can find sets -Rs-„-i and Rg^+i such that p e 
fl-n-i '^ '^^sfc • Hence there exists a sequence {Rsk)ke2. of sets in 1Z such that 

P^nZoH-nr'Rsk- □ 

Remark.. With a slight modification of this proof somewhat more can be es- 
tablished: namely, a finite sequence of sets i?s„, -Rs„+i) • • • ) Rsn can be extended 
to a bi- infinite sequence (Rs^)n&z such that if p e (Xn^~''R'SkJ m < n, then 
P '= Hj^o H-n ^~^Rsk ■ We can even go further and make the same claim about 
extending a one-sided infinite sequence Rs„-^, Rsm+n ■ ■ ■ to a bi-infinite one. 

Definition 5.6. We define the diameter d(TZ) of a partition TZ by 



d(TZ) = max d(Ri) 



where d{Ri) = sup^ ^^g^. d{x,y). 
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Definition 5.7. We call a topological partition a generator for a dynamical 
system (X, 0) if lim„^oo d {\J\ (P^U) = 0. 

If 7?. is a generator then clearly lim„_»oo d (n"„ '^^.Sk) = for any sequence of 
symbols (si)iez € {0, . . . , — 1}^. The converse is also true (see exercise 5.1). In 
addition d ^H^o fl-n ^~''^sk^ = 0. Hence in Proposition 5.5, if 7?. is a generator 

and p G nZo fn^^. then p = f]Zo fl-n 

The following proposition gives sufficient conditions on a topological partition in 

terms of its diameter for it to be a generator. 

Proposition 5.8.. Let {X, </>) be expansive and TZ be a topological partition such 
that d{TZ) < c where c is the expansive constant. Then TZ is a generator; 

Proof. The set H^oo 'P~''^sk contains at most one point and thus has zero 

diameter: for if there exists p, q G Pl^^^oo '^"'^-^^fc ' then d(0"p, 0"q) < c for n G Z 
implying p =ji. Since f]-n<P^'' Rs,, C n"„ 0"''^, (lim„^co fl-n '^"''^sj = 
d (n^oo = 0. From Exercise 5.1 we get that TZ is a generator. □ 

Remarks. Generally we merely have the inclusion relation 



(5.9) nn^"'^«^ ^ n^"'^^'. 

n=0 —n — oo 

but not equality. However, when the sets of the partition are small enough- namely, 
when the hypothesis of Proposition 5.8 is satisfied- we do have equality: that is, if 

nr=o n-n'^-'Re. + then nr=o TtJ^. = n.oo r'-RT,. _ 

Prom the inclusion relation (5.9) we see that if 7r(s) = p, then p G iJa^. Thus if 
X belongs only to then sq = i- In particular, by the remark following Definition 
5.1, if p G i?i or p G i?i , then Sq = i.. In addition, if there exists sequences s,< 
such that 7r(s) = 7r(t) = p and so = i ^ j = to, then p G i?i fl Rj, i ^ j; and 
conversely. In which case p G (i?i — R") fl {Rj — Rj") = dRi fl dRj : i.e. p belongs 
to the boundary of partition elements. 

Let TZ = {Ri, . . . , Rn} be a generator for a dynamical system {X, 0) Let S be 
the subset of the full AT-shift defined by 



(5.10) S={s = (...,s„,...): f]f]cp->'R,,j^^}. 

n— — n 

Because the topological partition 7?, is a generator, the non-empty infinite inter- 
section r\k=-n 'P '^Rsk consists of a single point. Therefore, we can define a 
map TT : E — > X by 

oo 

(5.11) 7r(s) = Pi (/)"i?s_„ n n • • • n 0-"i?s„ . 

Proposition 5.12.. Let the dynamical system {X,(j)) have a topological parti- 
tion TZ which is a generator. Then E as defined by (5.10) is a closed shift-invariant 
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Figure 5.1. Commutative diagram for symbolic representation 

subset o/Sat and the map vr given by (5.11) is a factor map of the dynamical system 
(S, cr) onto {X,(f)) i.e., n satisfies the following items of Definition 4-6: 

(i) (TTT = TT(p, 

(ii) TT is continuous, 

(iii) TT is onto. 

Proof. To prove S is closed we must show that if s = (..., Sfe, ...) e S then 



(5.13) 



n=0 —n 



which then implies that s G S. For each n > 1 there is a sequence t = {. . . ,tk, . ■ .) € 
S such that d{t, s) < 1/n. This means that tk = Sk, —n < k <n so that 



—n —n 

Because t gT,, this set is non-empty. Since this is so for arbitrary n and these sets 
form a decreasing sequence of non-empty closed sets, applying compactness we get 
5.13. 

To prove S is cr— invariant we must show that if s € S then crs e S : in other 
words, for n > 0, if f]\ (l)-''Rsk ^ then n"„ (l>~''Rsk+i ¥= 0- This follows from 
using the distributive property of (f)~^ with respect to intersections and reindexing: 
i.e. 

Cn \ n+1 n— 1 

— n / —n+1 —n+1 

We now turn our attention to the properties of tt. 

(i) TT satisfies ttct = (jjir. This follows from reindexing after applying the property 
that a homeomorphism commutes with the closure operation and preserves inter- 



SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 



23 



sections: to wit, 



n=0 —n 



n=0 —n 



oo n— 1 

n n 

n=0 —n—1 



(^wj TT is continuous. Prom the generating property of TZ, given e > there is 
a positive integer n such that ^(0-71 ^"'^-^si.) < ^- Thus, for s,t G Eg, there is a 
d > 0, namely S = l/(n+ 1), such that if d{s, t) < 6 then 7r(s), n{t) e H-n 't'~^Rsu- 

(Hi) TT 15 onto. This follows immediately from Proposition 5.5. 

□ 

Exercises. 

5.1 Let 7?. be a topological partition for a dynamical system (X, </)). Prove 
that if Hm„^oo d (H-n 't'~^Rsk) — fo'' ^^ly sequence of symbols (si)iez G 
{0, . . . , - 1}^ then lim„^oo d (V"„ </''^7^) = 0. 



6. Markov partitions and symbolic extensions. 

Definition 6.1. We say that a topological partition TZ for a dynamical system 
{X, (/>) satisfies the n-fold intersection property for a positive integer n > 3, if Rs^ fl 
7^0, l<fc<n-l=> flLi -R^fc 7^ 0- Furthermore, we call a 
topological partition Markov if it satisfies the n-fold intersection property for all 
n > 3. 

Remark.. In section 3.1 and before the term "Markov" topological generator 
was defined, we considered the partition C = {Ca ■ a G A} consisting of the 
elementary cylinder sets Ca = {s G T,q : sq = a} for a dynamical system (Sg,(j) 
where Eg is a shift of finite type base on an alphabet A. As one might have guessed 
this partition is the prototype of a topological Markov generator. 

Proposition 6.2. If TZ is a Markov partition, then so is 0*^'^ for any m < 

n. 

Proof. We leave the proof as an exercise. □ 

If a topological partition TZ satisfies the n-fold intersection property, then it 
satisfies fc-fold ones for all smaller k. To increase the order we shall utilize the 
following. 
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Bootstrap Lemma 6.3. IfTZ satAsfies the 3-fold and TZ'-'^^ satisfies the n-fold 
intersection properties, n > 3, thenTZ satisfies the {n+l)-fold intersection property. 

Proof. Suppose Ri^ fl (t)~^Ri,^^^ 0, 1 <k <n. Because H. satisfies the 3-fold 
intersection property, we have 

Ri, n r^Ri,^, n 0-'%^, 9^ 0, 1 < fc < n - 1. 

In other words, 

n r'^i.+i) n n r'^i.+J ^ 0, i < fc < n - i. 

Because TiP''' satisfies the n-fold intersection property, we obtain 

n+l n 

fl 4>-^Ri, = f|<^-'=(i?,, n ^ 0- 

1 1 

□ 

Suppose a dynamical system {X. (p) has a Markov generator TZ = {Ro^ .... i?,v-i}.| 
We define an associated topological Markov shift given by the directed graph G 
whose vertices are labelled by ^ = {0, 1, . . . , — 1} and in which the i-the vertex 
is connected to the j-th, i j, iff i?j fl (t)~^Rj ^ 0. So by definition of the Markov 
shift associated with a transition matrix of a directed graph, 

(6.4) Eg = {s = (s„)„ez : Rsn-i n ^~^Rs^ ^ 0, s„ S >l, n e Z}. 

This set coincides with the subsystem defined by 5.10 which is easily seen as follows. 
On one hand, for s G Eg, each of the closed sets 



{ fl ^-''Rs, \ n = l,2...} 

k— — n 

for any n > is nonempty since the finite intersection under the closure sign is 
nonempty due to the Markov property. For increasing n these closed intersec- 
tions form a decreasing sequence of non-empty sets; and therefore by compactness 

r]Zo(X=-„ Rs, ^ 0. On the other hand, if fl^ oHLZ^^^^ 0, then 
each finite intersection under the closure sign is non-empty which in turn implies 
that each pair of intersections Rs^. fl ^~^Rs^ 7^ 0, for arbitrary k G Z. 

Main Theorem. 

Theorem 6.5. Suppose the dynamical system {X,<j)) is expansive and has a 
Markov generator TZ = {Rq, ■ ■ ■ ,Rn-i}- Then the map tt, as defined by (5.11), is 
an essentially one-to-one finite factor map of the shift of finite type Tiq, as defined 
by (6.1), onto X. Furthermore, If{X,^) is irreducible, then so is (Eg,(t). 

Proof. We must establish (i) - (v) in Definition 4.6. That tt is a factor map- 
namely, it satisfies items (i), (ii), and (iii)-is the content of theorem 5.12. 

In order to establish (iv)-nam.ely, a bound on the number of pre- images under 
TT-we introduce the following concept. 
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m-1 




m 



= t. 



m 



Figure 6.1. A diamond 



Definition 6.6. A map tt from to X said to have a diam,ond if there are 
two sequences s,t G for which 7r(s) = 7r(t) and for which there exist indices 
k < I < m such that Sk = tk, si ^ ti, Sm = tm- 



Lemma 6.7. // the number of pre-images of a point is more than N'^, then n 
has a diamond. 

Proof. We apply the familiar "pigeon hole" argument. Let s^^\ . . .5^^^+^^ be 
+ 1 different sequences which map to the same point. Since the sequences are dis- 
tinct, there are a pair of indices fc, m such that the allowable blocks s|^''^j , . . . s^^^'j'"'^'' 
are distinct. There arc A^ distinct choices of pairs of symbols (s^^^Sm) : so by 
the "pigeon hole principle" there must be two allowable blocks S[fc f„]' '^[fe m]' 
that (Sfe\sm) = iSk\sl^!)- But, since the blocks are different, there is an index 
I such that s'f^ ^ sp"*. Thus the two sequences s*-*-*, .s'--'-' map to the same point, 
agree at indices k, m, but differ at I which is between k, m, which means there is a 
diamond. □ 

Lemma 6.8. // there exists a bilaterally transitive point with two pre-images, 
then IT has a diamond. 

Proof. Let a BLT p have two pre-images. As we have indicated in the remark 
following 5.8, there are two sets Ra, Rb & 'R-,a ^ b, such that p G RaHRb- For each 
n > 0, the family of sets 



{(^"'Rs-n n-'-nRsoD... (t)~'^Rsn : s e Eg where sq = a} 



covers Ra, and the family 



{(/>"-Rt_„ n •• • ni?to n . . .^-"i?t„ : i G where to = b} 

covers Rb- Thus, by compactness, there exists s,t G Eg with sq = a,to = b such 
that 



oo n 

PG fl f]<p-^Rs, 

n=0 —n 
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and 

oo n 

pef]f]cf>-''Rt, 

n—O —n 

Since p is bilaterally transitive and Rq is open, ^"'p G -Ro for some positive n and 
(/i^p e -Ro for some negative m. Thus by the remark following (5.8), Sm = tm = 
0, So = aj^ b = to,Sn =tn =0, which is a diamond for tt. □ 

Lemma 6.9. If d{TZ) < c/2, then n has no diamonds. 

PROOF. Since wa = ^tt, we can asume without loss of generality that fc = 1 in 
the definition of diamond. Assume that p — 7r(s) = Tr{t) where 

s = (. . . , s_2, a,bo,bi, . . . , bm-i, d, Sm+l, ■ ■ ■), 

*=(•••) t-2, a, Co, Ci, . . . , Cm-l, d,tm+l, ■ ■ ■)■ 

We must show that 6; = c; for < Z < m — 1. Because [a, bo,bi,. . . , bm-i, d\ is an 
allowable block in Eg, 

(PRa n Rb, n (p-^Rb, n • • • n </.-"+^E„_i n </.-™iid ^ 0. 

Choose a point q in this open set. Because tt is onto, there is a sequence 

u = {.. ., U-2, a,bo,bi,..., bm-i, d, Um+i, . . . ) e Sg 

such that 7r(w) = q. Also since [a, co, ci, . . . , Cm-i, d] is an allowable block and Eg 
is a shift of finite type, there is a sequence v G Eg such that 

V = {.. ., U-2, a, Co, Ci, . . . , Cm-l,d, Um+1, ■ ■■)■ 

Thus 

r = 7r(u) e (pRa n i?co n <?i-ii?ci n • • • n ^!>-™+ii?™_i n ?!>-™iid. 

From d{Ri) < c/2 and G i?f,, n i?c, for < / < m — 1, we conclude by the 

triangle inequality that d(0'q, (/)'r) < c. Furthermore, (i((/)"q, (/)"r) < c/2 for n < 
and n > m—1. The expansive property then implies that q = r. Thus Rb, n i?ci 7^ 
which implies that Rbi fl Rc, ^ 0. However, elements of TZ are pairwise disjoint: so 

bi = ci. □ 

(iv) There is a bound on the number of pre-images of it. 

(v) A BLT point has a unique pre-image. 

Because 7?. is a generator, n can be chosen so that d (V",i 4'''Ti) < c/2. By 
Proposition 6.2, V"„ (p'^'R- is again a topological Markov partition. For this partition 
the associated shift of finite type of (6.4) is given by the higher edge graph 
Let 7r(2»+i) be the map of EG(2n+i) onto X according to (5.11). It has no diamonds: 
so by Lemma 6.7 a point has at most iV^(^"+^) pre-images, and by Lemma 6.8 a BLT 
point has only one. The original tt satisfies tt = 7r^^"+^)^a-" where ip a conjugacy 
of Eg onto T,Q(2,^+l). Thus we have that under tt a point has at most A''^(^"+^) 
pre-images, and a BLT point has a unique pre-image. 

We defer the proof of irreducibility to Exercise 6.2 □ 

Converse to the Main Theorem. 
Recall that we introduced in 3.1 the partition C ~ {Ci : i ~ 0, . . . , N —1} consisting 
of the elementary cylinder sets Cj = {s G Eg : Sq = i} for a dynamical system 
(Eg, (t) where Eg is a shift of finite type base on an alphabet ^ = {0, 1, . . . , N— 1}. 
This partition is a topological Markov generator. 
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Theorem 6.10. Let {X^cj)) he a dynamical system, (Eccr) an irreducible shift 
of finite type based on N symbols; and suppose there exists an essentially one-to-one 
factor map n from Sg to X. Then the partition TZ defined by TZ = {Ri = 7r(Ci)° : 
i = 0, . . . , N — 1} is a topological Markov generator. 

Remark. Note we assume tt is a factor map which has a unique inverse for each 
bilaterally transitive point, but no bound is assumed on the number of pre-images 
of arbitrary points: i.e., w satisfies (i),(ii),(iii), and (v) of Definition 4.6 but not 
(iv). However, in Corollary 6.12 we shall show that (iv) follows from the others 
under the hypothesis of expansivity. However, as Exercise 6.3 shows property (v) 
is essential: we cannot obtain it from expansivity and (i) through (iv). 

Proof. 

We must prove the following items 

(1) Elements of TZ are disjoint: 

(2) The closure of elements of TZ cover X. 

(3) TZ is a generator. 

(4) TZ satisfies the Markov property. 

(1) Elements of TZ are disjoint: i.e., RiDRj = 0,i ^ j. 

The idea of the proof is to use bilaterally transitive points to overcome a difficulty: 

namely, maps in general do not enjoy the property that the image of an intersection 
is equal to the intersection of images, but one-to-one maps do. Suppose Ri f] Rj ^ 
for i ^ j. Then, by Proposition 4.7 BLT{Ri n Rj) ^ 0. By Propositions 4.8 
and 4.9, tt"-*^ maps BLT{Ri) and BLT{Rj) homeomorphically onto BLT{Ci) and 
BLT{Cj) respectively. Therefore tt^i maps BLT{R,nRj) = B LT (R,) f] B LT (Rj) 
homeomorphically onto BLT{Ci n Cj) = BLT{Ci) n BLT{Cj), which impUes that 
^ BLT{C^ n Cj) C Cj n Cj, a contradiction. 

(2) X = u£o'^- _ 

X = n{T,G) = TT U^o^ Ci = uf^^Tr{Ci) = U^^^Ri, the last inequality following 

from Proposition 4.9. 

For the next two items we need a lemma. 

Lemma 6.11. Under the hypothesis of 6.10, 7r(n^ o--'=C^ J = CCA~^^k for 
m <n. 

Proof. Once again we use the bilaterally transitive points to deal with images 
of intersections. We have the following string of equalitic^s. 



^if] ) = ^ ^^^(fl '^"'C'.. ) = 7^(0 BLT{a-^Cs,)) 

m \ m / m 

which by injectivity of tt and shift-invariance of bilateral transitive points 



= f]n{BLT{a-''Cs,)) =f]na-'^BLT{{Cs,)) 

m m 

which by commutativity of tt and Proposition4.8 



= {^4>-''ABLT{C,,)) = fl<^-'=(5LT7r(C,J) 
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which by Proposition 4.9 



f|0-'=(BLT(i?,J) = BLTif]ct>-HRs,)) 



Q.E.D. 

(3) TZ is a generator. 

Because C is a generator, d (D-n (^~''Csk) ~^ 0- By Lemma 6.11 , 7r(n"„ a~'^Cs^.) =| 
{X^n ^~''Rsk- So, by continuity of tt, we get 



d |^Q<^-'=E,, j = d I^Qr'i?.. j ^ 0. 

(4) TZ satisfies the Markov property. 

Suppose n (/)~-^i?si+i 9^ 0, 1 < fc < n — 1. By Lemma 6.11 we have 7r[Cs. n 
cr-iC,,^+i] = Rs, n 4>-^Rs,+^ 7^ 0, 1 < /c < n - 1. Thus C^. n a-'^Cs,+, 7^ 0, 1 < fc < 
n — 1. Since C satisfies the Markov property, nfe=i ^~''^sk for ah n > 1. So 

T^dlLi '^"''^aJ = fCF^ 7^ for all n > 1. Therefore, flLi 5^ » 

for all n > 1. □ 

Corollary 6.12. If in addition to the hypotheses of Theorem 4-18 the dynam- 
ical system {X, (j)) is expansive, then tt is finite. 

Proof. We derive 4.6(iv) from from the assumption that the domain of tt is 
irreducible, tt satisfies 4.6(i), (ii), (iii), and (v), and </> is expansive. This is an 
immediate consequence of Theorems 6.10 and 6.5. □ 

Exercises. 

6.1 A topological partition TZ for a dynamical system (X, 0) is Markov if and 
only if 

n n 

fi 7^ 0, n r'iJa, 7^ =^ n 5^0' 

k—O k=—n k——n 

for n > 0. 

6.2 Under the hypothesis of Theorem 6.5 show that if (X, (j)) is irreducible, then 

so is (Eg, (t). 

6.3 Show that in Theorem 6.10, condition (v) cannot be replaced by condition 
(iv). Hint consider the following example. Let 




and TT : Xx — > X2 be defined by 



7r[l,l]=7r[2,3]=7r[3,2]=l 
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1 



Figure 6.11. tt indicated by edge-labels 

7r[l,2] = 7r[2, 1] = 7r[3, 1] =0. 

Show disjointness is violated in the image partition of the elementary 

cylinder sets. 

6.3 Let TT be a factor map from a topological Markov shift (Ea, <j) to an expan- 
sive dynamical system {X, (j)) with expansive constant c and let the partition 
7^ defined by 7^ = {R^ = 7r(C0° : i = 0, . . . , iV - 1} satisfy fi(7^) < c/2. 
Show if TT has a diamond, then there exists a point p e X with a continuum 
of pre-images. 



7. Product Structure. 

The Markov property for a topological partition is an infinite set of conditions. 
It is the crucial one for obtaining a topological Markov shift representation of a 
dynamical system, but it could be difficult to verify. However there is another more 
useful criterion for getting it to which we now now turn our attention. It involves 
exchanging one set infinite set of conditions for another of a different sort which are 
more readily checkable. Once more we looked to our concrete systems as as guide. 
The sets of the partitions in examples 3.1 and 3.3 have a product structure whose 
behavior with respect to action of a mapping is intimately tied up with the Markov 
property. 

A general notion of partition without regard to any other consideration is the 
following. 

Definition 7.1. A partition of a set R is defined to be a family H = {ft.(p) : 
p e i?} of subsets of R such that for p, q e 

(1) P G Kp) 

(2) /i(p)n/i(q)^0^Mp) = Mq)- 

Definition 7.2. We call two partitions 

n = {/i(p) : p e i?}, 

V = {^;(p) -.PGR} 
of R transverse if, /i(p) fl v{q) ^ for every p, q e i?. 
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A set R with two transverse partitions 7i, V, can be viewed as having a product 
structure something Hke that of a rectangle which suggests the following graphic 
terminology: we shall refer to the elements H as horizontals and those of V as 
verticals. When we are dealing with elements i?j of a topological partition TZ = 
{Ri :« = !,..., N}, each of having a pair of transverse partitions Hi, Vi, we refer 
to hi{p), as the horizontal through p in Ri and to Vi{p) as the vertical. 

Next we introduce notions concerned with the behavior of horizontals and ver- 
ticals under the map associated with a dynamical system. We shall stick to the 
convention that under the action of a map verticals seem to contract and horizon- 
tals seem to expand. While we don't insist that the diameters of the images of these 
sets actually increase or decrease, this will generally be the case. In the literature 
one encounters the term stable set for what we call a vertical and unstable set for a 
horizontal. 

Definition 7.3. Suppose a dynamical system {X, (j)) has a topological partition 
TZ = {Ri}, each member of which has a pair of transverse partitions. We say 
alignment of verticals and horizontals are respectively maintained by by (j) and (/)~^ 
if for all i,j 

(1) p e i?i n 4>~^Rj Rj n (/)Ui(p) c Wj(0p); 

(2) p e i?i n (t>Rj => Rj n (/)"^/ij(p) c hj{(l>-^p). 

We actually require something stronger. 

Definition 7.4. In a dynamical system {X,(j)) we say a topological partition 
TZ = {Ri} has property M if each set Ri has a pair of transverse partitions such 
that alignments of horizontals and verticals are maintained by (j) and its inverse 
respectively in such a manner that the image of any vertical and the pre-imagc of 
any horizontal is contained in a unique element of TZ. In other words 7.3 (1) and 
(2) are replaced by: 

(1) pe RiH 4>-^Rj => 4>vi{p) c vj{4>p), 



(2) p e i?i n (j)Rj 



We remark that with respect to horizontals 7.6(2) can be expressed alternatively 
as follows: 

p € Ri n Rj ^ (j>hi{p) D hj{(j)p). 









p 




Figure 7.1. Property M 



Proposition 7.5. IfTZ has property M, then so does TZ^'^\ 
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Figure 7.11. Violations 

^-ir. 




R; 



property M 



Figure 7. III. Property M on TZ'-'^^ 

Proof. Let Ri n (j)~^Rj be a member of 71^"^^ Since a partition of a set 
induces one on a subset, the horizontals and verticals of Ri induce corresponding 
partitions of Ri fl ^~^Rj : namely, 

(1) n{Ri n = {%(p) = h,{p) n ^-^Rj : p e i?i n (l)-^Rj}, 

(2) v{Ri n (l^-^Rj) = {vij{p) = vi{p) : p e i?i n 

First to verify that this pair of partitions is transverse, we observe that if p, q e 
Ri n (j)-'^Rj then by 7.4(1) 

Vi{q) C (j)-\ji(j)q) C <j)-^Rj. 
From definition (1) we have 

hijiv) n t;ij (q) = hi{p) n (^-^iij n Wi(q) = hi{p) n t;i(q) 7^ 0. 



Second, we show that (j) and its inverse map verticals and horizontals so as to 
satisfy property M. Let p G Rid <f)~^Rj fl (p~^{Rj fl <p~^Rk)- On one hand, it is 
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immediate from definition that 

(l>vij{p) = <^wj(p) c uj((/)p) = %-(^;ip). 

On the other, 
□ 

Corollary 7.6. If TZ has property M then so does 7^*"^ for n = 1, 2, . . 



Proof. Repeated use Proposition 7.5 using the identity (T?.'"')'^' =7^("+i). □ 

Proposition 7.7. For a dynamical system {X,(j)) if a topological partition TZ 
has property M, then TZ satisfies the 3-fold intersection property. 



w 



q <> 



\/\/\ 



Figure 7. IV. 3-fold Intersection Property 

Proof. Let p e (jyRi nRj and q e i?^ n (f>~^Rk ^ 0. Then by transversaUty 
Vj{q)r\hj{p) ^ 0. Futhermore, Vj{q)r\hj{p) C Rj and Vj{q)r\hj{p) C 4>~^Vk{(j)q)r\ 
(phi{(f>~^p) C 0"^i?fc n (l)Ri : So ^Ri n Rj n (l)~^Rk ^ 0. Thus we have 

Ri n (j)-^Rj ^ 0, Rj n (j)~^Rk i^$=^ Rif\ 4>~^Rj n 4)~^Rk. 

□ 

Corollary 7.8. Given a dynamical system (X, 0), if a topological partition 
TZ has property M, then TZ^^^ is satisfies the 3-fold intersection property for n = 
1,2,.... 



Proof. Follows from Corollary 7.6 and Proposition 7.7. □ 
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Theorem 7.9. Given a dynamical system {X,(j)), if a topological partition TZ 

has property M, then TZ is Markov. 

Proof. Follows from Corollaries 7.6, 7.8, and the Bootstrap Lemma 6.3. For 
instance, TZ^"-~^^ and T?.^"^ satisfies the 3-fold intersection property, so TZ^'^~^^ 
satisfies the 4-fold one. Working our way back, we get 7^("-2) satisfies the 5-fold 
one, etc. Finally, we get that TZ satisfies the (n-|-2)— fold intersection property; but 
this is true for any n. □ 

We conclude this section with a theorem which is useful in applications to dynam- 
ical systems having smooth manifolds as phase spaces. For that theorem boundaries 
of partition members will play a role. In addition we shall need more topological 
structure than that provided by mere existence of a pair of transverse partitions. 

First we turn our attention to boundaries. In certain problems the burden of 
establishing the Markov property for a partition via property M can be cased by 
merely verifying a similar property for boundaries. The reader will get a good 
illustration of this when we discuss in detail Markov partitions for automorphisms 
of the two torus. 

Employing the usual notation, we have that the boundary of an element Ri in 
a topological partition TZ is given by dRi = Ri — Ri. We denote the union of all 
boundaries of elements of TZ by dTZ = dRi. Suppose the boundary dRi of each 
element of TZ is the union of two subsets: one, dyRi which we shall call the vertical 
boundary of Ri, the other, OhRi, the horizontal boundary of Ri. We denote the 
union of all vertical boundaries of elements of TZ by dyTZ = [j^ dyRi, and the union 
of all horizontal ones by OhTZ = [J^ duRi ■ 

Definition 7.10. We say that a topological partition TZ has boundaries satis- 
fying property M if the following hold for each i : 

(1) dR^ = dvR,UdHRi, 

(2) Vi{p) ndRi C dnRi, 

(3) hiip)ndRiCdvRi, 

(4) (f>dvTZcdTZ, 

(5) (p-^dnTZ c dTZ. 

We introduce the additional topological structure needed for the next theorem. 

Definiton 7.11. We call a metric space R an abstract rectangle if it is home- 
omorphic to the Cartesian product two metric spaces- i.e. there exist two metric 
spaces H, V and a homeomorphism $ of the Cartesian product H xV onto R. 

Sets with a pair of transverse partitions usually arise in this way. Let $(p^, p^) — 
p, where p e i? and (p^,p^) & H x V. Define the following horizontal and vertical 
sets of R : 

h{p) = ^{ix,p^) : X e H}, 
v{p) = ^{{p\y) -.yeV}. 

Naturally the two partitions 

n = {/i(p) -.PGR}, 



V = Mp) :pGR} 
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Figure 7.V. Abstract rectangle 



of R are transverse since 

«(p)nMq) = {$(p\q')}/0- 

In addition, for each pair of points p, q G i? the map (p, q) v{p) (1 h{q) is 
continuous, onto, and maps /i(p) x v{q) homeomorphically onto Ri. Thus we could 

have assumed that H, V were subsets of R in the first place. Wc use the letters we 
do to designate these subsets in order to suggest horizontal and vertical lines. 

Theorem 7.12. In a dynamical system {X,(j)), if each element of a topological 
partition TZ is a connected abstract rectangle, the alignments of which are main- 
tained by (j) and its inverse respectively, and if TZ has boundaries with property M, 
then TZ itself has property M- i.e. TZ is a Markov partition. 

Proof. 











1 
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Figure 7. VI. Impossible boundary picture 
We give the proof only for verticals which consists in proving 
peRif) (j)~^Rj (l)Vi{p) C Rj. 
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Our proof involves one proof by contradiction established by means of a second. 
The main one is a contradiction to the assumption that (j)Vi {p) ^ Rj . The other one 
contradicts the connectivity of (f)Vi {p) , which is a consequence of the the following. 

Since Ri homeomorphic to hi{p) x Vi{p), the vertical Vi{p) is connected: for 
otherwise Ri would not be. Therefore, the homeomorphic image 4>Vi{p) is connected 
as well. Thus, if (j)Vi{p) ^ Rj, then we would have that 4'Vi{p) n dRj ^ 0. Hence 
there would exist a point q € (j)Vi{p) n dRj which is also in <j)Vi{p) n Rj : for if not, 
then there would be an open set U D 4>Vi{p) n dRj such that U n 4>Vi{p) H Rj = 0, 
and the open sets Rj and U U {X — Rj) would disconnect 4>Vi{p). 

By preservation of alignments, we have 4>Vi{p) fl Rj C Vj{4>p) so that q G Vj{(jyp). 
Thus q E ShTI from which follows by property M for boundaries that 4>~^{q) € dTZ. 
However, (l>~^{q) G Vi{p) C Ri, which contradicts Ri fl dRi = 0. 
□ 

We now turn our attention to 2-dimensional toral automorphisms in generality. 
While there exists non-measurable automorphisms, for us toral automorphisms will 

mean continuous ones. 

8. Markov Partitions for Automorphisms of the 2-torus. 

Let X = IR"/Z" be the n-dimensional torus and A a n x n matrix with integer 
entries and determinant ±1. Such a matrix defines an automorphism of the n-torus 
in the manner described in Section 2.3. The set of such matrices forms a group 
called the general linear group GL{n,Z). Both a matrix A and the automorphism 
(j) it defines are called hyperbolic, if A has no eigenvalue of modulus one. 

We shall devote the rest of this section to the two dimensional case; i.e. n = 2. 
Let 

a b 



Eigenvalues of A are are the solutions of the quadratic equation 

x"^ - (traceA)a; + detA = 0. 

Here hyperbolicity mc^ans that A has two distinct eigenvalues, say A and fi which 
are irrational numbers. Since A/U = det^l = ±1, we can assume that |A| > 1 and 
< 1. An easy calculation shows that the row vectors 

VA = (c, A - a) 
= (c, /i - a) 

are eigenvectors associated with A and with /i respectively. The action of A on 
a vector v is to contrac;t its v^,-component by fi and expand its VA-component by 
A. Directions may or not be reversed depending on the signs of the eigenvalues. 
We refer to the direction of va as the expanding direction and that of v^ as the 
contracting one. Finally let £\ be a line through the origin in the expanding direction 
and £n the one in the contracting direction. We call these lines, which are invariant 
under the action of A on the plane, the expanding and contracting eigen-line 
respectively. The slopes of these lines arc mx = {\ — a)/c and = (/j, — a)/c. 
From these formulae one sees that these lines pass through no lattice points other 
than the origin: for if they did then the slopes m\ and would be rational 
numbers and so would A and ^,. 
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Theorem 8.1. A hyperbolic toral automorphism is expansive. 

Proof. Let <p be an automorphism and p, q G X be any two different points 
of the two dimensional torus. Let c = |/i|/8. We shall show that there exists 
n e Z such that d(0"p, </'"q) > c. By translation invariance of the metric we have 
d{p, q) = d{{0}, p — q) : so it suffices to show that d{{0}, 0"r) > c, for any r 7^ {0} 
in X. 




Figure 8.1. A fundamental region and eigen-directions of A 



We take the torus to be given by the fundamental region 

X = {{x,y):\x\<l/2, |y| < 1/2} 

with the appropriate boundary identifications. In this region the metric on the 
torus coincides with the Euclidean one: namely, (i(p,q) = ||p — q|| where ||(a;,y)|| = 
i/xM-y^. Let r = p — q 7^ (0,0). Let r^ and be the va— component and the 
v^— component of r respectively. Then from the triangle inequality 

|A"r;,|-|/.%|<||</."r||<|A"rA| + |M%|. 

One of the components r^, r^^ is not zero. We can assume that r^ ^ : otherwise 
replace (f> by (^~^ in the argument. We can also assume that < 5 : for, if not, 
replace (j) by 4>'' for large enough fc. If ||r|| > -1^, then ||0"r|| > c for n = 0. If 
llrll < choose n > 1 such that 



n+l 



< \rx\ < 



Then the following inequalities inequality show that 0"r is in the fundamental 
region and that ||0"r|| > c. 



|A"rA| + |/x%|<i + ^ 



n+l I 
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□ 



Remark. We now have all the ingredients for a formal proof that the toral auto- 
morphism of example 3.3 enjoys the conclusions of theorem 6.5 about representing 
dynamical systems by topological Markov shifts. In this case the automorphisms 
be given by the matrix 



and the topological Markov shift (Sccr) by the edge-graph G of figure 3. VIII. 
Theorem 8.1 shows that (j) is expansive, a necessary item in the hypothesis of 
the main theorem, Theorem 6.5. Theorem 7.13 can be applied to show that the 
partition in example 3.3 is Markov, another necessary item. Finally in 3.3 we 
have already shown that the partition in question is a generator, the remaining 
requirement of the main theorem. 

Returning our attentions to the general case, we shall exploit one of the properties 
of a hyperbolic automorphisms: namely, a matrix in the group GL{2, Z) specifying 
a hyperbolic automorphism is conjugate to another one all of whose entries bear 
the same sign. More specifically we have the following theorem of Williams [W] 
proved by entirely elementary methods of plane geometry. 

Theorem 8.2. Let A e GL{2,Z) be hyperbolic. Then there exists C,P e 
GL{2,Z) such that CAC~^ = eP where e = ±1, the choice of sign being the same 
as that of A, and the entries of the matrix 



are non-negative. 
Proof. 

Choose a pair of lattice points (a, f3) , (7, 6) such that 

(i) The angle {a, /3){0, 0)(7, S) is acute and i\ lies in it but does not. 

(ii) The closed parallelogram with vertices the origin, the two lattice points, 
and the lattice point {a + f3 + S) contains no other lattice point. 

This can always be done. One way is to use continued fractions to approximate 
slopes of lines, a discussion of which shall be deferred to a remark. An even more 
elementary way is the following. 

First, chose initial lattice points (a, /3) and (7, S) so close to ix that (i) is satisfied 
and no other lattice points lie between them and the origin on a direct line. Then 
if the parallelogram in (ii) contains another lattice point in its interior, connect 
it to the origin with a line segment. Form a new pair of lattice points by taking 
the closest lattice point to the origin on this segment and selecting the one from 
the previous pair for which (i) holds. Continue this process until condition (ii) is 
satisfied. 

Condition (ii) is equivalent to the area of the parallelogram equalling 1. 
Consider the linear map given by the matrix 
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Since 

(0,1)C= (a,/3) 
(1,0)C=(7,(5), 

C maps the principal fundamental region-namely, the closed unit square-onto the 
parallelogram. We shall show that C provides the sought after conjugating trans- 
formation. 



(0,0) 




Figure 8. II. Parallelogram and expanding direction 



The two lines JxJ.^ that are the images under of ^a,^/^ are the expanding 
and contracting cigcn-lines for each of the transformations zbCAC""'^. Choose P to 
be the one that preserves the orientation of Jx, the choice of sign being that of A. 
This means that the matrix P has |A| as the expanding eigenvalue and either ±/Lt 
as the contracting. Furthermore, because C""'^ maps the parallelogram onto the 
unit square and ^\ passes through the parallelogram while £^ does not, the line J\ 
passes through the first quadrant and the line the second. 

To prove that P is non-negative, or cquivalcntly, that P maps the first quadrant 
into itself, we must just show (0, 1)P and (1, 0)P lie in the first quadrant. We shall 
give the proof only for (0, 1)P : the arguments which follow work equally well for 
the other lattice point (1,0)P. 

There are two cases depending whether the contracting eigenvalue /U of P is 
positive or negative. In the second case (/x < 0), the linear map P reflects the first 
quadrant about the eigen-line J\, while in the first case [fi > 0) no reflection takes 
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place. Let c denote the origin, a the lattice point (0, 1), and a' its image under P. 
Let b be the projection of a on the line Ix in the direction parallel to and b' its 
image under P. The point b' is also the projection of a' on the line Ix. 




Figure S.IIL Geometrical figure for proof without reflection 



We deal first with the case without reflection. Let £x,^ij, be the lines through a 
parallel to £x,£fi respectively. The notation |pq| stands for the length of the line 
segment with end-points p,q. On one hand, since \a'b'\ — • |ab| < |ab|, the 

point a' lies between the lines ?;j,?a- On the other, since |cb'| = |A| . |cb| > |cb|, the 
point a' lies to the left of J^. The region bounded by these three lines, in which a' 
thus lies, is contained in the first quadrant. 




+ 



Figure 8. IV. Geometrical figure for proof with reflection 
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For the case with reflection, let c' be the intersection of the hne l\ and the 
vertical through a'. Suppose that a' belongs to the fourth quadrant but not the 
first. The point a' being a lattice point implies that |a'c'| > 1. However, because 
triangle c'a'b' is similar to cab, |a'c'| = . |ac| < |ac| < 1, a contradiction. 

□ 

Remark. By means of continued fractions we can somewhat augment the con- 
clusion of Theorem 8.2: namely, we can conjugate so that the following two condi- 
tions to hold simultaneously. 

(1) < -1, 

(2) < mA < 1, 

where rn^ is the slope of the contracting eigenline and rn\ is the slope of the 
expanding one I\ for P. 

The first inequality indicates that the contracting eigen-line for P passes through 
the second quadrant between the lattice points (0, 1) and (—1, 1); and the second 
that the expanding one passes through the first quadrant under the lattice point 
(1,1). For Theorem 8.4, the main one of this section, one does not need more 
than what is provided by Theorem 8.2. These extra properties make life a little 
less difficult. The first one makes Figure 8. VI easier to draw. The second one 
obviates repeating proofs covering slightly different geometrical figures. Not taking 
advantage of it multiplies the number of cases in the proof, and we shall have 
enough of them as it is. Since we shall not be using the full strength of this remark 
one can skip the remainder of it and proceed to directly Theorem 8.4. We shall be 
using the second property which is very easy to achieve by itself. 

Prom the theory of continued fractions, we know that every irrational number 
can be written uniquely as an infinite continued fraction [oq, ai, . . . ] = oq + l/(ai + 
1/ . . . ) where a„ G Z for all n and a„ > for n > 0. In addition, the continued 
fraction of a quadratic surd has a periodic tail: namely the tail can be written as 
[6i, . . . , bm] > 1, where the overbar means infinite repetition of bi through bm- In 
[ATW] the following was proved. 

Theorem 8.3. Let A e GI/(2,Z) be hyperbolic. The slope mx, being a qua- 
dratic surd, can be written mx = [ao, ai, . . . , a„, 6i, . . . , bm], where m is as small as 
possible. If 



C = 

then CAC~^ = eP where 



1 \ /O 1 \ (0 1 

1 ao / V 1 ai j ■ ■ ■ V 1 a„ 



1 \ / 1 



N 



for some positive integer N and e is the same as in Theorem 8.2. Furthermore, the 
slopes the eigen-lines o/P satisfy mx = . . . , bm] > 1 and—l/rfi^ = [bm, ■ ■ ■ ,bi] >| 
1. 



The matrix 



l\p/0 1 

1 [l 



achieves the above two conditions in the remark. 
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Theorem 8.4. Let (j) be a toral automorphism whose defining matrix is either 
P or — P where 




is a hyperbolic matrix in GL{2,Z), with non-negative entries. Then there exists 
a Markov generator TZ* for (j), the members of which are of parallelograms. The 
associated Markov shift is given by a directed graph also specified by P : i.e., the 
edge graph with connections given by P consists of two vertices labelled I and II 
with p directed edges from I to itself, q from I to II, r from II to I, and s from II 
to itself. 



q paths 




r paths 



Figure 8.V. Edge graph defined by P 



Proof. We shall assume that the expanding eigen-line of P as given by Theorem 
8.2 passes under the point (1,1); if not, conjugate P by the matrix 

-a 

which reflects the first quadrant about the line y = x. 

Before proceeding in earnest, we need some notation. Dropping the bars, we 
now let va, be the expanding and contracting eigen- vectors of P and l\, 
the corresponding eigenlines through the origin. We denote lines parallel to these 
through a point p by assigning p as a superscript. For example, l^^'^^ denotes the 
line through (0, 1) parallel to etc. 
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We define the following points as depicted in Figure 8. VI: 

o=(0,0) 

o' = (l,0) 
o"^(l,l) 
o"'^(0,l) 

a = £^ n £a 



(8.5) 



a' 



a" 



^(1,0)^^(1.0) 

^(M)n4M) 
a'" = if^^ n 4°''^ 

W = efi^^'°'> nix = 6 + (1,0) 
b" = £(.i'i)n4°''^ =b + (i,i) 



c' 



^(1,0)^^(1,1) 

c^^(o,-i)n£, 

d = £^n4"'''^ 

d' = ^(,i'0)n4°''^ = d+(i,o) 

d* = d'- (0,1). 

We have drawn Figure 8. VI as if the first statement in the remark following 
Theorem 8.2 holds. This places the point c' in the unit square. Since we are not 
using this condition, c' could appear anywhere to the left of the line a; = 1 in the 
strip between the lines y = and y = 1. 

Let Ri be the interior of parallelogram acd'b' and Ru the interior of parallelo- 
gram c'd'b"a". 

The closed set Rj U Ru, as we shall show, is a fundamental region, which we 
shall call the principal Markov one. As drawn in Figure 8. VI, this set is equivalent 
modulo to the unit square by sliding Aaa"'c one unit to the right and Ad'b"a" 
one unit down. But in general we need a slightly more elaborate proof. 

First, no two points in the interior of RjURu are equivalent because RiDRu is 
disjoint from its four neighboring unit translates which totally bound it. Second, the 
set of all Z^— translates of Rj U Ru covers the plane because all integral horizontal 
translates of the union of this set with its unit downward vertical translate covers the 
infinite strip between the lines y = and y = 1 and all integral vertical translates 
of the strip covers the plane. 

Thus we can view the torus to be the set X = Rj U Rjj with points on the 
boundary identified by lattice translations. 

The proof of the; theorem involves four cases: two subcases arise for each matrix 
±P depending whether /x is positive or negative. These amount to the four possible 
combinations, 

(A,/x) = (±|A|,±|/x|), 
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/ (1,1) 



(-1,1) 



/(-1,0) +' 




c \ 



.1 
d 

1 1 1 - ■ 


\ 

\ 






Ri 




d* 


^^^^^^ \ 

— \ 

\ 


\ a =0 
\ 

\ 

\ 

\ 

\ 

\ 



7 (0,1) 



(1,1) 



a = o' 



/ (0,0) 



•N a' = o' 
\ 

\ 



7(1,0) 

V 



(0,-1) 



1^ 



(0,0) 



1^ 



Figure 8. VI. A remarkable fundamental region 

of signs for the eigenvalues of the matrix representing </>. The simplest case is when 
both A and /U are positive. Things are more difficult when either is negative, espe- 
cially [i. So that the proof appears less tedious, we divide that for each case into 
five steps. 

Case I: A > 0, ^ > 0. 

Step 1. The family TZ = {Ri,Rii} is a Markov partition. 

Our aim is to show that TZ satisfies the hypothesis of Theorem 7.12. 

By virtue of their construction as parallelograms the members i?/, Rjj of TZ are 
connected. These open parallelograms are obviously abstract rectangles, each being 
homeomorphic to the Cartesian product of two of open intervals. The two in ques- 
tion arc a pair intersecting sides of a parallelogram minus endpoints. Horizontals 
/, //, are given by 



and verticals oi Ri,i 



h,{p) = (lr\R,. 

Since P defines a linear transformation of the plane, the image of a line parallel 
to an eigen-line is another line parallel to the same eigen-line: so alignment of 
verticals and horizontals is maintained by (j) and (f)~^ : namely, 



(1) 
(2) 



p€ RiHcf) ^Rj 
p e ii, n (fyRj 



RjDcjwiip) C Vjicjyp); 
Rjr\(t)-^hi{p) C hj{4>-^p) 
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for i,j ^ I, II. 

Next we verify that the boundaries of these of members satisiy property M of 
Definition 7.10 which entails five items. The first of these concerns dividing dRj 
and dRii into vertical and horizontal pieces. As shown in Figure 8. VI we have 

dRj = dvRilJdHRi, 

dRjj = dyRji U duRii-, 

where 

dvRi = ac U b'd', 

dnRi = ab' U cd', 
dvRii = c'd' U a"b", 
dHRii = c'a" U d'b", 



each being the union of two line segments. Utilizing the boundary identifications, 
we have that dTZ consists of two transverse line segments intersecting at the origin: 
namely, 

dHn = cb' c£i = ex, 

8.6 

^ ' dvTl = ad C (f,. 

It is clear from Figure 8. VI that (2) and (3) of 7.10 are satisfied: namely, 



Vi{p)r\dRiCdHRi, 



hi{p)ndRi C dvRi, 

From (8.6) and the property that the restrictions of P and respectively to the 
lines tx and are contractions we get 

cb' C (cb')P 

(8.7) 

^ ^ (ad)P C ad. 

We can restate (8.7) as items (4) and (5) of 7.10: namely, 

4)dvn C dUv C dn, 

(t>~^dHii c OhH c dn. 

Thus wc have established that TZ satisfies the hypothesis of theorem 7.12: so it 
is Markov partition. 

Remark. At this point it may be instructive to remark on our definition of 
topological partition. This example illustrates the advantage of using open sets 
over their closures as members of such a partition. For one thing, Rj ^ Rj and 
Rii ^ Rii . For another, while Rj and Ru are abstract rectangles, their closures 
are not. For instance, Rj is homeomorphic to a Cartesian product: namely, the 
product of the line segment ab' minus the end-points with the segment ac minus the 
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end-points. However, Rj is not homcomorphic to the Cartesian product of ab' with 
ac, since part of the segment ab' is in the boundary and part isn't. Furthermore, 
dvTZ and dn'R- are connected Hne segments which get mapped into themselves 
under (j) and respectively. This would not be the case if 7?. = {Ri,Rii}. If 
one's definition of topological partition involves closures of open sets as members 
rather than open sets, then one is forced into somewhat greater contortions in order 
to achieve the same results. 

The Markov partition TZ is not necessarily a generator. The trouble is that, while 
the members of TZ are connected, those of TZ^^^ =TZ\I <j)~^TZ may not be in which 
case nonempty sets of the form H-n 'P^'^^s^ may consist of more than one 

point. To overcome this let us examine the sets i?,; n (t>^^Rj E TZV 4)^^TZ. As we 
shall see each consists of a union of disjoint open parallelograms, the number of 
which is given by the matrix P. A remedy is immediately suggested. 

Step 2. The family TZ* = {Rl : 1 < k < N*} consisting of all connected 
components of the sets Rir\<j)~^Rj £ TZ\/ (t)~^TZ is Markov partition. 

Once again we must show that the members of TZ* satisfy the five items of 
Definition 7.10. 

In the universal cover the image (i?,;)? is an open parallelogram that has been 
stretched by a factor of A in the UA-direction and shrunk by a factor /x in the 
w^-direction. This parallelogram passes through various Markov fundamental re- 
gions; and in each one it when it intersects a parallelogram equivalent to Rj, the 
intersection is a parallelogram. No two of these intersections share an equivalent 
point: for otherwise a violation of (2.1.1(3)) with respect to the fundamental region 
{Ri U Rii)V would be committed. While the set (phi{p) n Rj may consist of several 
horizontals, the set (j)hi{p) D R^. is either empty or a single one. Furthermore, if 
4>hi{p) nRl for one horizontal hi{p) of Ri, then (phi{p') Ci R^. ^ 9 for any other 
horizontal hi{p') in R^. Back on the torus the various non-equivalent parallelograms 
represent disjoint connected sets which we have labelled R^.,! < k < N*. Being 
parallelograms, these sets are abstract rectangles. 

Alignment of verticals and horizontals for these members of TZ* is maintained by 
(p and for the same reason it is for members of TZ. 

Regarding boundaries we have that 

OhTZ^^^ = dnTZ* = (c"b')P. 
dvTZ^'^'^ = dvTZ* = ad. 

Thus, just as for TZ, we have 

(j)dvTV = (ad)P C ad = dTZy C dTZ*, 
(t^-^'dnTZ* = c"b' c (c"b')P = dnTZ* c dTZ*. 

The partition TZ* satisfies the hypothesis of Theorem 7.12: so it is Markov. 
By reasoning as before, a parallelogram of the form {Rl)P passes through various 
fundamental regions; and in each one it when it intersects a parallelogram equivalent 
to i?;*, the intersection is a parallelogram, no two of which intersections share an 
equivalent point. Hence, a nonempty set of the form fii?* is a single connected 
parallelogram. 
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If ii^ is one of the parallelograms in (pRi n Rj , then the length of its WA-dimcnsion 
is the same as that of Rj while the length of its v^-dimemsion is |/Lt| times that of 
Ri. Similarly, a non-empty set of the form (j)R* fl i?* is a single connected open 
parallelogram, the length of its VA-dimension being the same as that of Rj and the 
length of its Vju-dimension fj, times that of R^. 




Figure 8. VII. A nonempty parallelogram of the form f|"^ 4>~''R*sk 



Likewise, a non-empty set of the form f]"^ (j)^''R*^ is a single connected open 
parallelogram, the length of its VA-dimension is times as that of and the 
length of its v^-dimension times that of i?*^. From this we have 

d ^\/ </)'=7^* j =d(7^)/|/x|"+l^0,asr^^oo. 

In other words, TZ* is a generator. 

Step 4. Let ^{(pRi n Rj) denote the number of disjoint parallelograms in the 
intersection (pRi fl Rj . 



#{(l)Ri n Ri) = number of lines x = 0,x = 1, . . . , traversed by {Ri)P; 
H^{(l>Ri ^ Rii) = number of lines y = 1, y = 2, . . . , traversed by {Ri)P, 

which leads to 

ij^{(j)Ri n Ri) = x-coordinate of (0, 1)P = p; 

#{(i)Rii n Ri) = x-coordinate of (1, 1)P - (1, 0)P = q; 
'■^■'^^^ #((/>i?7 n Rii) = y-coordinate of (0, 1)P = r; 

#{(i)Rii n Rii) = y-coordinate of (1, 1)P - (1, 0)P = s. 



To establish Step 4 it is important to imdcrstand how (i?/)P and {RjiP inter- 
sect various Markov fundamental regions particularly how they begin and end. We 
must show that Figure 8. VIII truly represents the situation: namely the segment 
{b'd')P lies within the segment db. First, if a parallelogram (i?i)P intersects a lattice 
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Figure 8. VIII. How {Ri)P and {Rii)P intersect various fundamental 
regions 

translate of Rj then it stripes all the way across. As it does do, it passes strictly 
through the lattice translate without ever straddling any part of a horizontal bound- 
ary: for otherwise there would be a violation of the property that {Rj U Rii)P is a 
fundamental region because dn^KP) 3 dnTi- 

Since P is a contraction on the line we have |(ab)P| < |ab|; and since the 
point a = (0, 0) is fixed, the point (b)P lies inside the segment ab. It then follows 
that the point (b')P = (b)P + (p, q) lies inside a lattice translate of ab-that is, it lies 
on the lino strictly between a = (p, q) = (a')P and b = (p, q) + b. The same 

is true for (d')P : for otherwise {Ri)P would straddle part of its own boundary on 
the line ^(p-I'?). Thus 

(ac)P C (ad)P C ab. 

and 

(b'd')P C ab. 

So the parallelogram (-R/)P begins (as shown in the leftmost figure of Figure 
8. VIII) with its left vertical boundary contained in the segment ab on the line 
and ends (as shown in the middle figure of Figure 8. VIII) with its right vertical 
boundary contained in the segment ab on the line We see that the (Ji/)P 

begins by striping across Rj. Its lower horizontal boundary lies on the line i\ and 
covers the lower horizontal boundary of i?/ . Not shown in the figure is manner by 
which {Ri)P stripes across the top of fundamental region consisting of i?/ + (r, s— 1) 

and Rjj+{r, s—1) with its upper horizontal boundary contained in the line £^-^ = 
^(r,s) rpj^g parallelogram (i?/)P ends by striping through Rjj + {p — l,q — 1). 

The other parallelogram {Rjj)P begins where {Ri)P leaves off. Its left vertical 

boundary is contained in ab, and its right one in ab where a = (p + q + s) and 

b = h + {p,q) as shown in the middle figure of Figure 8. VIII. The first set that 
(i?//)P stripes through is Ri + {p, q) and the last Ru + {p + r — 1, q + s — 1) . It 
stripes through Rj + (p, q) on top of Rj n {Ri)P + (p, q) and ends at the bottom of 
a Rii + (p + r — 1, g + s — 1) as shown in Figure 8. VIII. 

As drawn in Figure 8. VI, the line segment aa'" lies totally in Rj connecting its 
bottom horizontal boundary with its top: so {Ri)P has the property that it stripes 
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across R; + (m. n) if and only if it passes through the interior of the line segment 
aa"'+(m, n). But one must bear in mind that the picture does not represent all cases 
as c' may not be located in the principal unit square so that the line segment aa'" 
may not lie totally in i?/. Nevertheless, because of the way (i?/)P begins and ends 
it obeys this property when (m, n) = (0,0) and (m,n) = {p — Consequently, 
it also obeys this property for at all intermediary lattice points. By intermediary 
we mean at (m, n) where < m < p — 1 and < n < g. Similarly for (i?//)? it 
satisfies the property for (m, n) = {p, q), (m, n) = {p + i — 1, g + s — 1), and thus 
for all intermediary {rn,n). 

The number of times {Ri)P passes through a lattice translate aa'" equals the 
number different lines x = integer crossed by {Ri)P, which equals p for (-R/)P and 
r for (i?77)P. 

Similarly, a parallelogram {Ri)P intersects a lattice translate of Rjj if and only 
if (-Ri)P crosses the same a lattice translate of a"'a"; and the number of times this 
happens equals the number lines y = integer crossed by (-Ri)P, which equals q for 
{Ri)P and s for {Rii)P. 

Remark. A consequence of the fact that right contracting boundary of {Ri)P 
is contained in a lattice translate of ab is that 4>Ri fl Rn ^ which means q > 0. 
But this we aready know from the property that P is hyperbolic. Similarly, r > 0. 
However, either p or s could be 0, but not both. 

Step 5. The transition matrix associated with mapping of the Markov generator 
TZ* by (j) coincides with matrix which specifies the automorphism-namely, P- the 
edge graph of which is illustrated in Figure 8. V. 

Prom what we have established there are N* = p + q + r + s parallelograms R* 
in TZ*. We separate the subscripts into four sets: 



(1) {1,...,P} 

(2) + 

(3) {p + q+l,...,p + q + r} 

(4) {p + q + r+l,...,p + q + r + s}. 



We label the members of TZ* accordingly: 

RlU---URl = RiH (t>Ri, 
U • • • U = Rn n <l>Ri, 
P^+q+i u • • • u -Rp+g+r = -R/ n (/)i?77, 

Rp+q+r+l U • • • U = Rll n (/>i?7/- 

As we have said before: if 4'hi{p) n i?^ 7^ for one horizontal fti(p) of Ri, then 
4>hi{q) n i?^ ^ for any other horizontal /ii(q) of Ri. Therefore, the image of each 
parallelogram ii^ contained in an Ri intersects the same elements of TZ*. From this 
we get that fl i?;* ^ 0, equivalently ii^ fl 7^ 0, whenever either 

e (1) U (3) and le{l)U (2), 



or 



fc e (2) U (4) and / e (3) U (4). 
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Case II: A > 0,/x < 0. 

Except for the lattice points of the unit square labelled by o's we redefine the 
other ones in Figure VI by translating them by pv^ with p, chosen so that 

(ci)P = a. 

For example, anew = add + pv^. Whereas before the point a was the fixed point o, 
now it isn't. Figures 8.IX(A) and 8.IX(B) depict new principal Markov fundamental 
regions in which the old one of Figure VI has been translated by pv^. The difference 
in the two figures is the location of o with respect to b. In a moment we shall argue 
that Figure 8.IX(B) cannot occur. 



(0,1) + P V n 




Figure 8. IX (a). New principal Markov fundamental region, a transla- 
tion of the previous with o between a and b. 



Step 1. The new partition TZ = {Ri, Rii} is Markov. 

The proof of this is the same as in Step 1 of Case I except the verification of 

(pdvU C dTZv C dTZ, 
(t)~^dHTl C OhTI C dll 

is more involved and is as follows. 

For this new partition, (8.6) remains valid: namely, 

OhTI = cb' C II, 
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Figure 8.IX(b). New principal Markov fundamental region, a transla- 
tion of the previous with o above b. 

dvTl = aAdl^. 

Before there was a common fixed point on OhT^ and dyl^ ■ namely, origin = 

OhTI n dvTi-, but now the intersection 9^7?. n dyTZ is no longer a fixed point. The 
line segment ad still contains the origin as a fixed point. The line segment cb also 
contains a fixed point, but this is not so obvious. This will be a consequence of the 
following version of (8.7): 

(cb')P D lattice translate of cb' 

(8.11) 

^ ' (ad)P C ad; 

Verification of the first relation is more difficult than before. That the second 
relation holds is immediate because the translation was chosen to make it so. The 
first relation rests on the property of the partition that a, b, c, d are the only points 
on the contracting boundary which have equivalent ones on the expanding. There 
are no others! 



Recall d* = d' — (0, 1). Now because (d)P = a, the point a = (d*)P is a lattice 
translate of a and the line £^ is mapped to the line by P. See Figure 8.X. Let 
c, b, and d be the points on the line l'^ that are lattice translates of the points c, 
b', and d*. We must show that (cb')P contains cb. Because the point (d*)P lies 
on the line i'^, so do the points (c)P and (b')P. Since (b)P and (c)P arc contained 
in the segment ad and the pair of points (b')P and (c)P are lattice translates of 
neither a, b, c, nor d, this pair lies outside the segment be. Since (d*)P lies inside, 
the segment, (cb')P overlaps cb. 
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Figure 8.X. Image of expanding boundary of new principal Markov 
fundamental region. 

Now we are are in a position to prove that Figure8.IX(B) cannot occur. Let 
a = (P: 9) Since (d)P = a, we have that a = d'P. Let b be the point equivalent 

to b' on the line ^[f ''^ , upon which a lies; and let c be the lattice translate of c on 
the line . The segment cb is a lattice translate of the horizontal boundary 

of Rj U Rji. If b were below o on the line then by virtue of the fact that 

H is negative the point b'P would lie above {p, q) on this line, and hence above b. 
Since d'P = a lies below b, the parallelogram RjP would straddle the part of its 
own horizontal boundary: namely the segment cb. This would contradict the fact 
that i?/P is a subset of a fundamental region. 

Step 2. The family TZ* = {i?^ : 1 < < N*} consisting of all connected 
components of the sets Rif\(j)~^Rj e 7?. V (^"^7?. is Markov partition. 

Proof is the same as Step 2 of Case I. 

Step 3. The Markov partition TZ* is a generator. 

Proof, same as Case I, Step 3. 

Step 4. Let il={^Ri n Rj) denote the number of disjoint parallelograms in the 
intersection cpRi fl Rj. 



^icjyRi n Ri) = number of lines x = 0,x = 1, . . . , traversed by {Ri)P; 
#{(j)Ri n Rii) = number of lines y = l,y = 2, . . . , traversed by {Ri)P, 



which leads to 



#{(j)Ri n Ri) = x-coordinate of (0, 1)P = p 

#{4>Rii n Ri) = x-coordmate of (1, 1)P - (1, 0)P = q: 
i^{(j)Ri n Rii) = y-coordinate of (0, 1)P = r 

#{(l)Rii n Rii) = y-coordinate of (1, 1)P - (1,0)P = s. 



Argument similar to Case I, Step 4 but based on Figure 8. XI. There is a slight 
difference here. The vertical line segment aa'" is offset from the line a; = 0, and 



52 



SYMBOLIC DYNAMICS AND MARKOV PARTITIONS 




a =(0,0) +pvn 

Figure 8. XI. How the new {Ri)P and (i?//)P intersect various funda- 
mental regions 



a'" a" does not lie on y = 1. If pv^ = then 

#{^Ri n Ri) = number of lines a; — ^ = 0,a; — ^=1,..., traversed by {Ri)P; 
#{^Ri n Rii) = number of lines y — r] = l,y — r] = 2,..., traversed by {Ri)P, 

To get the desired result we observe that the parallelgram {Ri)P traverses the line 
a; — ^ = m if and only if it traverses the line x = m. Similarly the parallelogram 
traverses y — r] = m if and only if it traverses y = m. 

Step 5. The transition matrix associated with mapping of the Markov generator 
TZ* by (j) coincides with matrix which specifies the automorphism-namely, P- the 
edge graph of which is illustrated in Figure 8. V. 

Argument same as Case I Step 5. 

Case III: A < O./x > 0. 

The sets Ri are the same as for Case I. 

Step 1. The new partition TZ= {Ri, Rii} is Markov. 

This depends on establishing (8.6): namely 

cb' C (cb')P 
(ad)P C ad. 

The argument for the first relation is similar to that of Case II, Step 1. The 
argument for the second is the same as (8.7) of Case I. 

Step 2. The family TZ* = {RI : 1 < k < N*} consisting of all connected 
components of the sets Ri fl <p~^Rj gTZW (f)~^TZ is Markov partition. 

Proof is the same as Step 2 of Case I. 
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\ 



b 




(0,0) = a 



a = (-p-r,-q-s) 

Figure 8. XII. How {Ri)P and {Rii)P intersect various fundamental 
regions for Case III. 

Step 3. The Markov partition TZ* is a generator. 
Proof, same as Case I, Step 3. 

Step 4. Let #{4>Ri n Rj) denote the number of disjoint parallelograms in the 
intersection (j)Ri fl Rj. 

#{(t>Ri n Ri) = number of lines x = 0,x = —1, . . . , traversed by {Ri)P; 
ij^{(j}Ri n Rii) = number of lines y = —l,y = —2, . . . , traversed by {Ri)P, 

which leads to 



i^{<l>RinRi) 
#{(t>Rii n Ri) 
#{^Ri n Rh) 

#{^Rii n Rii) 



x-coordinate of (0, 1)P 
x-coordinate o/(l,l)P 
y-coordinate of (0, 1)P 
y-coordinate o/(l,l)P 



(1,0)P 



(1,0)P 



r; 



s. 



Argument the same as Case I, step 4 but based on Figure 8. XII. 
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Step 5. The fmnsition matrix associated with mapping of the Markov generator 
TZ* by (f) coincides with matrix which specifies the automorphism-namely, P- the 
edge graph of which is illustrated in Figure 8. V. 

Argument same as Case I Step 5. 

Case IV: A < 0,At < 0. 

The sets Ri are the same as in Case II- i.e., translations of the ones of Case I. 
Step 1. The new partition TZ = {Ri, Rn} is Markov. 
The argument is the same as in Case II, Stepl 

Step 2. The family TZ* = {RI : 1 < k < N*} consisting of all connected 
components of the sets Rif\(j)~^Rj G TZ\/ (f)~^TZ is Markov partition. 

Proof same as Case I, Step 2. 

Step 3. The Markov partition TZ* is a generator. 

Proof, same as Case I, Step 3. 




a = a - (-p-r,-q-s) 



Figure 8. XIII. How (-R/)P and (Rii)V intersect various fundamental 
regions Case IV 
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Step 4. Let 4j={<j)Ri fl Rj) denote the number of disjoint parallelograms in the 
intersection (j)Ri fl Rj . 

=ff{(f>Ri n i?/) = mmher of lines x = 0, x ^ —I, . . . , traversed by {Ri)P; 
^{(j)Ri n Rii) = number of lines y = —l,y = —2, . . . , traversed by {Ri)P, 

which leads to 



#{(l)Riir\Rii) = y-coordinate of (1,1)P - (1,0)P = -s. 

Argument the same as Case II, Step 4 but based on Figure 8. XIII. 

Step 5. The transition matrix associated with mapping of the Markov generator 
TZ* by (p coincides with matrix which specifies the automorphism-namely, P- the 
edge graph of which is illustrated in Figure 8. V. 

Argument same as Case I Step 5. 

□ 

Exercises. 

8.1 Let {X,(j)) be the dynamical system TZ = {Ri : i = 1,2,3} the Markov 
partition of Example 3.3. Show the partition S ~ {Si : i = 0, 1}, where 
So = RiU R2 and Si = R3, is Markov. Hint: S^^^ = TZ. Observe that 
associated directed graph G is vertex labelled graph of Figure 2. II. 

8.2 What is the maximum number of pre-images of n in the last example? In 
Theorem 8.4? 

9. Epilogue. The main theorem of Section 6 and its converse exhibit a du- 
ality between factor maps and Markov partitions. We are thus presented with a 
type of "chicken verses egg" question: which are more fundamental for getting 
concrete symbolic representations for concrete dynamical systems, Markov parti- 
tions or factor maps? In Example 3.2, we have with equal ease defined a factor map 
arithmetically producing from it a Markov partition and constructed a Markov par- 
tition thereby obtaining a factor map. However, in Section 8, we only constructed a 
Markov partition for an arbitrary hyperbolic automorphism of the two-dimensional 
torus. In trying to repeat the success of dimension two by drawing Markov parti- 
tions for hyperbolic toral automorphisms in dimension three, one is doomed to fail- 
ure as Bowen proves in [Bo2] . He shows that no point on the contracting boundary 
of a member of a Markov partition can there exist a tangent plane. The boiuidary 
is a forced to be a fractal. Even if such a figure could be constructed, rendering is 
certainly difficult, let alone incorporating it in a tiling of three space in a manner 
suitable for viewing. And then what about four dime ind higher?. There- 

fore, in any dimension other than two an arithmetic method seems the only hope. 
Kenyon, Vershik [KV], and Praggastis [P] attack the problem in this manner and 
obtain factor maps arithmetically for hyperbolic toral automorphisms. 



if{(j)Ri n Ri) = x-coordinate 
#{(j)Rii n Ri) = x-coordinate 
#{(j)Ri n Rii) = y-coordinate 



of (0, 1)P 

0/(1, 1)P 
0/(0, 1)P 



(1,0)P 



r; 
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Ultimately the answer to our chickcn-cgg question will probably turn out to be 
that neither takes precedence over the other. Bowen's proof of existence of Markov 
partitions and hence finite factor maps for general hyperbolic axiom A diffeomor- 
phisms supports this point of view [Bol]. In this work Bowen uses an argument 
employing a combination of both methods, and bootstraps his way to the desired 
result. It starts with a cover whose members have small diameter with respect to 
the expansive constant. A topological Markov shift is constructed by using the la- 
bels of the members as symbols and defining transitions according to the rule that 
the j-tli symbol follows the i-th if the image of the i-th member of the cover inter- 
sects the j-th. Using stable and unstable manifolds one is able to define a factor 
map from the symbolic shift to the phase space. This symbolic extension is much 
too big. The cardinality of a pre-images under this map can be non-denumerable. 
However, the images of cylinder sets form a second cover whose members are ab- 
stract rectangles obeying a Markovian property. This cover though may fail to form 
a topological partition for two reasons: some members may have no interior; and 
pairs of them may overlap in more than just boundary points. This cover is pared 
down by eliminating those sets with no interior. The overlap problem is solved by 
using some geometry of abstract rectangles: namely two abstract rectangles over- 
lapping in an open set can be partitioned into nonoverlapping abstract rectangles. 
The last step in getting a topological partition is to partition the members of this 
last cover abstract rectangles, no pair of which overlaps in an open set. Through all 
of this Markov property is still maintained by the members of this final partition. 
That this is a generator is gotten by the same argument as our Propostion 5.8. 

Returning to the line of investigation of Kenyon, Vershik, and Praggastis, there 
remain things to be understood, such as the connectivity of cylinder set images 
under arithmetically defined factor maps. Furthermore, the general case of Markov 
partitions for hyperbolic automorphims of n-dimensional tori has not yet been 
treated. 

References 

[ATW] Roy Adler, Charles Tresser, Patrick A. Worfolk, Topological conjugacy of linear endo- 
morphisms of the 2-torus, Trans. Amer. Math. Soc. ((to appear)). 

[AW] Roy L. Adler and Benjamin Weiss, Similarity of automorphisms of the torus, Memoirs 
American Math. Soc. 98 (1970). 

[Be] K. Berg, On the conjugacy problem for K-systems (1967), University of Minnesota. 

[Bol] R. Bowcn, Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms. Lec- 
ture Notes in Mathematics, Vol. 470, Springer- Verlag, Berlin, Heidelberg, New York, 
1975. 

[Bo2] R. Bowen, Markov partitions are not smooth, Proc. Amer. Math. Soc. 71 (1970), 130-132. 
[KV] R. Kenyon and A. Vershik, Arithmetic construction of sofic partitions of hyperbolic toral 

automorphisms, Ecole Normale Superieure de Lyon. 
[LM] D. Lind and B Marcus, An Introduction to Symbolic Dynamics and Coding, Cambridge 

University Press, Cambridge, New York, Melbourne, 1995. 
[I'l] B Praggastis, Markov partitions for hyperbolic toral automorphisms (1992), University 

of Washington. 

[W] R. F. Williams, The "DA" maps of Smale and structural stability, Proc. Symp. in Pure 
Math. 14 (1970), AMS, 329-334. 



